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; Search time 4437 Seconds 

(without alignments) 

11338.802 Million cell updates/sec 


Title: US-09-869-414A-3 
Perfect score: 2070 

Sequence: 1 atggcccaagccctgccctg aaaaaaaaaaaaaaaaaaaa 2070 

Scoring table: IDENTITY_NUC 

Gapop 10.0 , Gapext 1,0 

Searched: 22781392 seqs, 12152238056 residues 

Total number of hits satisfying chosen parameters: 45562784 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 
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em_estba : * 
em_esthum: * 
em_estin: * 
em_estmu: * 
em_estov: * 
em__estpl : * 
em_estro : * 
em_htc : * 
gb_estl : * 
gb_est2 : * 
gb_htc:* 
gb_est3 : * 
gb_est4 : * 
gb_est5 : * 
em_estfun: * 
em_estom: * 
em_gss_hum: * 
em_gss_inv: * 
em_gss_pln: * 
em_gss_vrt : * 
em_gss_fun: * 
em_gss_mam: * 
em__gss_mus : * 
em_gss__pro: * 
em_gss_rod: * 
em__gss__phg : * 
em__gss_vrl : * 


28: gb_gssl:* 
29: gb_gss2:* 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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RESULT 1 
AK014464 
LOCUS 

DEFINITION 


ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 


REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
MEDLINE 
PUBMED 

REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 


TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 


AK ° 14464 3859 b P mRNA linear HTC 05-DFC-?nn9 

Mus musculus 16 days embryo head cDNA, RIKEN full-length enrichfd 

fuU^^ Product :beta-site APP cleaving en ^ 

run insert sequence. 1 

AK014464 

AK014464.1 GI:12852334 
HTC; CAP trapper. 
Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi • 
MammaHa; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus . 

Carninci,P. and Hayashizaki, Y. 

High-efficiency full-length cDNA cloning 

Meth. Enzymol. 303, 19-44 (1999) 

99279253 

10349636 

2 

^ r £ i £ Cl/P "' Shibata ' Y - Hayat SU ,N., Sugahara,Y., Shibata K 
Itoh,M., Konno,H Okazaki,Y., Muramatsu,M. and Hayashizaki, Y. 

TeZrlT ^Y nd .r btraCti ° n ° f Ca P-trapper-selected cDNAs to 

prepare full-length cDNA libraries for rapid discovery of new genes 

Genome Res. 10 (10), 1617-1630 (2000) 9 

20499374 

11042159 

3 

Shibata, Kltoh,M., Aizawa,K., Nagaoka,S., Sasaki,N., Carninci P 

'A y T' J " Nishi,K., Kitsunai, T ., Tashiro,H., Itoh^" 
Sunu,N Is hll ,Y., Nakamura,S., Hazama,M., Nishine,T., Harada A 
YamamotcR., Matsumoto, H. , Sakaguchi, S . , Ikegami,T , Kashiwagi k' 

Yonedf y' i IT*'**' T °^ Y - Izawa ' M -' Ohara,^, Wa tahl££; 

olT, I V I 5 hlkaWa ' T " Ozawa # K., Tanaka,T., Matsuura,S., Kawai J 

MKEN ^;' M "" matSU ' M -' moue,Y. f Kira,A. and Hayashizaki, Y ' ' 

RIKEN integrated sequence analysis (RISA) system-384-f ormat 

sequencing pipeline with 384 multic ap ill ary sequencer 

Genome Res. 10 (11), 1757-1771 (2000) 

20530913 

11076861 

4 

Kawai,J., Shinagawa,A., Shibata, K., Yoshino,M., Itoh,M. 
Arakawa,T., Hara,A., Fukunishi, Y. , Konno,H., Adachi,J., 

^itoV"nJ ZaW >' M ;' Nlshi ' K " Kiyosawa,H., Kondo,S., Yamanaka,!., 
Saito,T Okazaki,Y., Gojobori,T., Bono,H., Kasukawa,T., Saito R 

SeL c ; ' T SU ? a ' H " A3hb — ' M -' Batalov.S., Casavant, T. ' ' ' 

* an "' W :' Gaasterland,T., Gissi,C, King,B., Kochiwa,H , 
Kuehl,P Lewis, S., Matsuo,Y., Nikaido,I., Pesole,G., 
Quackenbush J Schriml^.M., Staubli,F., Suzuki,R., Tomita,M., 
Wagner,L. Washio,T., Sakai,K., Okido,T., Furuno,M., Aono,H , 
Baldarelli,R., Barsh,G., Blake, J. f Boffelli,D., Bojunga,N , 
Carninci, P., de Bonaldo,M. F. , Brownstein,M. J. Bult,C 


, Ishii,Y. 
Fukuda, S . 


Fletcher, C, Fujita,M., Gariboldi,M. , Gustincich, S . , Hill,D., 
Hofmann,M. , Hume, D. A., Kamiya,M., Lee, N. H. , Lyons, P., 
Marchionni, L. , Mashima,J., Mazzarelli, J* , Mombaerts , P . , Nordone,P., 
Ring,B. f Ringwald,M. , Rodriguez, I . , Sakamoto, N., Sasaki, H., 
Sato,K., Schonbach,C. , Seya,T., Shibata,Y., Storch,K.F., Suzuki, H . , 
Toyo-oka,K., Wang,K.H., Weitz,C, Whittaker , C . , Wilming,L., 
Wynshaw-Boris, A. , Yoshida,K., Hasegawa,Y., Kawaji,H., Kohtsuki,S. 
and Hayashizaki, Y. 

TITLE Functional annotation of a full-length mouse cDNA collection 

JOURNAL Nature 409 (6821), 685-690 (2001) 

MEDLINE 21085660 
PUBMED 11217851 
REFERENCE 5 

AUTHORS The FANTOM Consortium and the RIKEN Genome Exploration Research 
Group Phase I & II Team. 

TITLE Analysis of the mouse trans criptome based on functional annotation 

of 60,770 full-length cDNAs 

JOURNAL Nature 420, 563-573 (2002) 
REFERENCE 6 (bases 1 to 3859) 

AUTHORS Adachi,J., Aizawa,K., Akahira,S., Akimura,T., Arai,A., Aono,H., 
Arakawa,T., Bono,H., Carninci,P., Fukuda,S., Fukunishi, Y. , 
Furuno,M., Hanagaki,T., Kara, A., Hayatsu,N., Hiramoto,K., 
Hiraoka,T., Hori,F., Imotani,K., Ishii,Y., Itoh,M., Izawa,M., 
Kasukawa,T., Kato,H., Kawai,J., Kojima,Y., Konno,H., Kouda,M. , 
Koya f S., Kurihara,C, Matsuyama, T . , Miyazaki,A., Nishi,K., 
Nomura, K. , Numazaki,R., Ohno,M. f Okazaki,Y., Okido,T., Owa,C, 
Saito,H., Saito,R. f Sakai,C, Sakai,K., Sano,H., Sasaki, D., 
Shibata,K., Shibata,Y., Shinagawa, A. , Shiraki,T., Sogabe,Y., 
Suzuki, H. f Tagami,M. f Tagawa,A. , Takahashi , F. , Tanaka,T., 
Tejima,Y., Toya,T., Yamamura,T., Yasunishi f A. , Yoshida,K., 
Yoshino,M., Muramatsu,M. and Hayashizaki , Y. 

TITLE Direct Submission 

JOURNAL Submitted ( 10- JUL-2000) Yoshihide Hayashizaki, The Institute of 
Physical and Chemical Research (RIKEN) , Laboratory for Genome 
Exploration Research Group, RIKEN Genomic Sciences Center (GSC) , 
RIKEN Yokohama Institute; 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, 
Kanagawa 230-0045, Japan (E-mail : genome-res @gsc . riken . go . jp, 
URL :http: //genome. gsc. riken. go. jp/, Tel : 81-45-503-9222, 
Fax:81-45-503-9216) 

COMMENT Please visit our web site (http://genome.gsc.riken.go.jp/) for 

further details. 

cDNA library was prepared and sequenced in Mouse Genome 
Encyclopedia Project of Genome Exploration Research Group in Riken 
Genomic Sciences Center and Genome Science Laboratory in RIKEN . 
Division of Experimental Animal Research in Riken contributed to 
prepare mouse tissues. First strand cDNA was primed with a primer 
[5* GAG AG AG AG AAG GAT C C AAG AG CTCTTTTTTTTTTTTTTT T VN 3*], cDNA was 
prepared by using trehalose thermo-activated reverse transcriptase 
and subsequently enriched for full-length by cap-trapper. Second 
strand cDNA was prepared with the primer adapter of sequence [5* 
GAG AG AG AG AT T C T C G AGT T AAT T AAAT T AAT CCCCCCCCCCCCC 3 ' ] . cDNA was cleaved 
with BamHI and Xhol . cDNA of size comprised longer than 7 kb was 
selected before cloning. Vector: a modified pBluescript KS(+) after 
bulk excision from Lambda FLC I. Cloning sites, 5* end: Sail; 3 1 
end: BamHI. Host: DH10B. 
FEATURES Location/Qualifiers 
source 1. .3859 


/organism="Mus mus cuius" 

/mol_type="mRNA" 

/strain="C57BL/6J" 

/ db_xr e f = " FANTOM_DB : 4 1 2 2 4 0 1 C 0 4 " 

/db_xref="MGI: 1896726" 

/db_xref="taxon: 10090" 

/clone="4122401C04" 

/tissue_type="head" 

/clone_lib="RIKEN full-length enriched mouse cDNA library" 
/dev_stage="16 days embryo" 
CDS 430. .1935 

/note="unnamed protein product; beta-site APP cleaving 

enzyme (MGD | MGI : 134 6542, GB | NMJU1792 , evidence: BLASTN, 

98%, match=3874) 

putative" 

/codon_start=l 

/protein_id="BAB29370 . 1" 

/db_xref="GI: 12852335" 

/db_xref="MGI: 1346542" 

/ trans lation="MAPALHWLLLWVGSGMLPAQGTHLGIRLPLRSGLAGPPLGIjRLP 
RET DE E S EE P GRRG S FVEMVDN LRGK S GQG Y YVEMT VG S P P QT LN I LVDT G S S N FAVG 
AAPHPFLHRYYQRQLSSTYRDLRKGVYVPYTQGKWEGELGTDLVSIPHGPNVTVRANI 
AAITESDKFFINGSNWEGILGLAYAEIARPDDSLEPFFDSLVKQTHIPNIFSLQLCGA 
GFPLNQTEALASVGGSMIIGGIDHSLYTGSLWYTPIRREWYYEVIIVRVEINGQDLKM 
DCKEYNYDKSIVDSGTTNLRLPKKVFEAAVKSIKAASSTEKFPDGFWLGEQLVCWQAG 
TTPWNIFPVISLYLMGEVTNQSFRITILPQQYLRPVEDVATSQDDCYKFAVSQSSTGT 
VMG AVI MEG F YWFD RARK RI G FAVS AC HVH D E F RT AAVE G P FVT ADME D C G YN I P QT 
DESTLMTIAYVMAAICALFMLPLCLMVCQWRCLRCLRHQHDDFADDISLLK" 

BASE COUNT 873 a 1082 c 1028 g 876 t 

ORIGIN 

Query Match 75.3%; Score 1559.6; DB 11; Length 3859; 

Best Local Similarity 87.5%; Pred. No. 6.2e-193; 

Matches 1798; Conservative 0; Mismatches 244; Indels 12; Gaps 8; 


Qy 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

I I I I I I I III MM I I I I I II M II I III I II II I II I I I I I I I II II I II I 

Db 430 ATGGCCCCAGCGCTGCACTGGCTCCTGCTATGGGTGGGCTCGGGAATGCTGCCTGCCCAG 489 

Qy 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 12 0 

II II I II I M I II I II M II I I II M I I II II I II I I I I II I I II M II I 

Db 4 90 GGAACCCATCTCGGCATCCGGCTGCCCCTTCGCAGCGGCCTGGCAGGGCCACCCCTGGGC 54 9 

Qy 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 18 0 

III II I I I M II II II II II I I I I I II I I II II II I II II I I II I M I I M I I 

Db 550 C T GAG GCT GC C CC GGGAGAC C GAC GAGGAAT CGGAGGAGC CT GGCC GGAGAGGCAGCT TT 609 

Qy 181 GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGATGACC 24 0 

I I I II I I I M I II II II II II I I I II II I II II M II M II II I II II II I I II I I 

Db 610 GTGGAGATGGTGGACAACCTGAGGGGAAAGTCCGGCCAGGGCTACTATGTGGAGATGACC 669 

Qy 241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 300 

II II II I I M II I II II I I II II I I II I II II II II II Mill II I II II II I I I 

Db 670 GTAGGCAGCCCCCCACAGACGCTCAACATCCTGGTGGACACGGGCAGTAGTAACTTTGCA 729 


Qy 


301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 
I II II I II II I I I II I I I II II I I I I II II I I II II II II I M I II I M I II II II I 


Db 730 GTGGGGGCTGCCCCACACCCTTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 789 

Qy 361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 420 

II II I I I I II I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I II | I I I | M I I I I I 
Db 7 90 TAT CGAGACCT CCGAAAGGGTGTGTAT GT GC CCTACACCCAGGGCAAGTGGGAGGGGGAA 84 9 

Qy 421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 480 

I I M I I M I I M I M I I II I 1 I I I I I I 1 I I I I I 1 I II II I II I I M 1 I I I I I II II II 

Db 8 50 CT GGGCAC C GAC CTGGT GAGCAT C C CT CAT GGC CC CAAC GT CACT GT GC GT GC CAACATT 909 

Qy 481 GCT GC CAT CACT GAAT CAGACAAGTTCTT CAT CAACGGCT C CAACTG GGAAGGCATCCT G 54 0 

II M I I M I I I I I I I I I I I M M I I I I II I I I I I II II I I I I I I 1 I I I I I I I I I I 

Db 910 GCTGCCATCACTGAATCGGACAAGTTCTTCATCAATGGTTCCAACTGGGAGGGCATCCTA 969 

Qy 541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

I II I II M I I M II I I I I I I M I M I I M I I I I I I I I I I I I II I I M I I I I I I I I 

Db 970 GGGCT GGCCT AT GCT GAGAT T GC CAGGCC C GAC GACT CT TT GGAGCC CTT CTTTGACT C C 1029 

Qy 601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

I I II I MINIMUM II M M M MM II M II II M I II II I I II M M I 
Db 1030 CTGGTGAAGCAGACCCACATTCCCAACATCTTTTCCCTGCAGCTCTGTGGCGCTGGCTTC 1089 

Qy 661 CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGGTATC 720 

I I II I II M M I I II I I M I I II I I I lllllllllllllllillll MUM 

Db 1090 CCCCTCAACCAGACCGAGGCACTGGCCTCGGTGGGAGGGAGCATGATCATTGGTGGTATC 114 9 

Qy 721 GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTATTAT 780 

M M I 11 ( M I I II I I II M M I M M M I II M II I II I I I I I I M I II I I I I I II 
Db 1150 GACCACTCGCTATACACGGGCAGTCTCTGGTACACACCCATCCGGCGGGAGTGGTATTAT 1209 

Qy 781 GAGGT CAT CAT T GT GC GGGT GGAGAT CAAT GGACAGGATCT GAAAAT GGACT GC AAGGAG 840 

II II I I 1 I I 1 I I II Mill M I II I M M (MM M I II M M I M II I M 

Db 1210 GAAGT GAT CAT T GT AC GT GT GGAAAT CAAT GGT C AAGAT C T CAAGAT G GAC T G CAAG GAG 1269 

Qy 841 TAC7VACTATGACAAGAGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCCAAGAAA 900 

I I M II II I II II I M II I M I I II M I M I II II I II I II I I II I II II 1 I I I II I 

Db 1270 TACAACTACGACAAGAGCAT T GTGGACAGT GGGACCAC CAAC CTTC GCTT GCC CAAGAAA 1329 

Qy 901 GTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCTGAT 960 

II I I I I II I I I I I I II II M I M II I I II M I II II II II I I II II II I II I III 

Db 1330 GTATTTGAAGCTGCCGTCAAGTCCATCAAGGCAGCCTCCTCGACGGAGAAGTTCCCGGAT 1389 

Qy 961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 1020 

II If I I I II II I I II M I II I I II M I I II II II I II I I I I I I I I I I I I I II II I I 
Db 1390 GGCTTTTGGCTAGGGGAGCAGCTGGTGTGCTGGCAAGCAGGCACGACCCCTTGGAACATT 144 9 

Qy 1021 TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCACC 1080 

M II II II II I II I II II II I I II I I M I M I II II I II I II II I II II II I I I 
Db 14 50 TTCCCAGTCATTTCACTTTACCTCATGGGTGAAGTCACCAATCAGTCCTTCCGCATCACC 1509 

Qy 1081 ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGACTGT 114 0 

I I I II I II I II II I I I II I II I II Mill M II M I II II II I I II II I II II II 
Db 1510 ATCCTTCCTCAGCAATACCTACGGCCGGTGGAGGACGTGGCCACGTCCCAAGACGACTGT 1569 

Qy 1141 TACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATCATGGAG 1200 

MINIM II II II II II II II II II II II II I II II N II II I II II II II II 
Db 1570 TACAAGT T C GC T GTCT CACAGTCAT C CAC GGGCAC T GTTAT GGGAGCC GT CAT CAT GGAA 1629 


Qy 1201 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1260 

M Mill II I I I I I Mill I I I I I I I I I I M I II I I I M M II II M M I II I I 

Db 1630 GGTTTCTATGTCGTCTTCGATCGAGCCCGAAAGCGAATTGGCTTTGCTGTCAGCGCTTGC 1689 

Qy 12 61 CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGACATG 1320 

I I I II 1 M II I M II I I I II M II II II I I I I M I I |] Mill II M II II 

Db 1690 CATGTGCACGATGAGTTCAGGACGGCGGCAGTGGAAGGTCCGTTTGTTACGGCAGACATG 1749 

Qy 1321 G AAGACT GT G G C T AC AAC AT T C C AC AG AC AG AT G AGT C AAC C C T CAT G AC CAT AG C C TAT 1380 

II I II II I I M M M I I I I I I I I I I II M I I I I I I M II I II II I I II I I I I M I II 

Db 1750 GAAGACT GT GGCT ACAACATTC C C CAGACAGAT GAGT CAACACTTATGAC CATAGC CTAT 1809 

Qy 1381 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 144 0 

M I I M M II M M I M I I I M I II II M I II II I II II I M I I I M I I I II I I M I 

Db 1810 GTCATGGCGGCCATCTGCGCCCTCTTCATGTTGCCACTCTGCCTCATGGTATGTCAGTGG 1869 

Qy 1441 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 

I I I M I II I I I II II M i M I I I II I I I M I II M I I I M I I I M I M M M I I I 
Db 1870 CGCT GCCT GCGTT GCCT GCGCCACC AGCAC GAT GACT TT GCT GAT GACAT CT CCCT GCT C 1929 

Qy 1501 AAGTGAGGAGGCCCAT GGGCAGAAGAT AGAGAT T CCCCT GGACCACACCT CC GT GGTT CA 1560 

MM M I M M M M M II M Ml MM I M M II M M M M M M II I 

Db 1930 AAGTAAGGAGGCCCGTGGGCAGATGATGGAGACGCCCCTGGACCACATCTGGGTGGTTCC 1989 

Qy 1561 CT TT G GT CACA — AGT AGGAGACAC AGAT GGCACCT GT G GCCAGAGCACCT CAGGACCCT 1618 

M M M I M II Ml MM I (MM I M M M M M 1 I II M M M II M M I 

Db 1990 CT TT GGT CAC AT GAGTT GGAGCTAT GGATGGT ACCT GT GGCCAGAGCAC CT CAGGACC CT 2049 

Qy 1619 CCCCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCC 1678 

I I M M I I M M II M I I II III MM I II M M Ml M I 

Db 2050 CACCAAC CT GC CAAT GCTT CT GGCGT GACAGA- ACAGAGAAAT C AGG CAAGCT GGATT AC 2108 

Qy 1679 AGGGAC T GTAC CT GTAGGAAACAGAAAAGAGAAGAAAGAAGCAC T C T GC T GGC GGGAAT A 1738 

MM II M II I M I M II II I I I I I II M I M I M I I I II I I I M M 

Db 2109 AGGGCTTGCACCTGTAGGACACAGGAGAGGGAAGGAAGCAGCGTTCTGGTGGCAGGAATA 2168 

Qy 1739 CTCTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAAC 17 98 

Ml I MM II I I II Ml I II M M I I II M I I II I II I I M M II II I 

Db 2169 T C CT TAGACAC CACAAACTT GAGT - T GGAAAT TT T GCT GC TT GAAG CT T CAGC CCT GAC C 2227 

Qy 1799 CTTTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAG 1858 

M II I I M I M II I II I I II I II II M I II I I M I II Mill 

Db 2228 CTCTGCCCCAGCGTCCTTTAGAGTCTCCAACCTAAAGTATTCTTTATGTC CTTCCAG 2284 

Qy 1859 AAGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGA 1918 

II II M II M I II II MM MM Ml II I II I M II I I I II II II I M M I I I 

Db 2285 AAGTACTGGCGTCATACTCAGGCTACC-CGGCATGTGTCCCTGTGGTACCCTGGCAGAGA 2343 

Qy 1919 AGAGAC CAAGCTTGTT TCCCTGCTGGC CAAAGT CAGTAGGAGAGGAT GCACAGT TTGCTA 1978 

I I I II I I I II I II I II I II I I II II I I M II II I If I M II II I 
Db 2344 AAGGGCCAATC-TCATTCCCTGCTGGCCAAAGTCAGCAGAAGA — AAGTGAAGTTTGCCA 24 00 

Qy 1979 TTTGCTTTAGAGACAGGGACTGTATAAACAAGCCTAACATTGGTGCAAAGATTGCCTCTT 2038 

I M M II II II I II II II I I I M II I M Ml I II I II I II I III MM 

Db 24 01 GTTGCTTTAGTGATAGGGACTGCAGACTCAAGCCT-ACACTGGTACAAAGACTGCGTCTT 24 59 


Qy 2039 GAATTAAAAAAAAA 2052 

II I I I I M II 
Db 2460 GAGATAAACAAGAA 2473 
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AK041285 3634 bp mRNA linear HTC 05-DEC-2002 

Mus musculus adult male aorta and vein cDNA, RIKEN full-length 
enriched library, clone :A530097B07 product :beta-site APP cleaving 
enzyme, full insert sequence. 
AK041285 

AK041285. 1 GI : 2 6334342 
HTC; CAP trapper. 
Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 

Carninci,P. and Hayashizaki, Y. 

High-efficiency full-length cDNA cloning 

Meth. Enzymol. 303, 19-44 (1999) 

99279253 
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2 

Carninci,P., Shibata,Y., Hayatsu,N., Sugahara,Y., Shibata,K., 
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Normalization and subtraction of cap-trapper-selected cDNAs to 
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3 

Shibata, K. 


10 (10), 1617-1630 (2000) 


Itoh,M. 
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RIKEN integrated sequence analysis (RISA) system--384-f ormat 
sequencing pipeline with 384 multicapillary sequencer 
Genome Res. 10 (11), 1757-1771 (2000) 
20530913 
11076861 
4 
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Quackenbush, J. , Schriml , L.M. , Staubli,F., Suzuki, R. , Tomita,M., 
Wagner, L., Washio,T., Sakai,K., Okido,T., Furuno,M., Aono,H., 
Baldarelli, R. , Barsh,G., Blake, J., Boffelli,D., Bojunga,N., 
Carninci,P., de Bonaldo,M. F. , Browns tein, M. J. , Bult,C, 


Fletcher, C, Fujita,M., Gariboldi, M. , Gustincich, S . , Hill,D., 
Hofmann,M., Hume, D. A., Kamiya,M., Lee, N . H. , Lyons, P., 
Marchionni, L. , Mashima,J., Mazzarelli, J. , Mombaerts , P . , Nordone,P., 
Ring,B., Ringwald,M., Rodriguez, I . , Sakamoto, N., Sasaki, H. , 
Sato,K., Schonbach, C. , Seya,T., Shibata,Y., Storch,K.F., Suzuki, H., 
Toyo-oka, K. , Wang,K.H., Weitz,C, Whittaker, C . , Wilming,L., 
Wynshaw-Boris, A. , Yoshida,K., Hasegawa,Y., Kawaji,H., Kohtsuki,S. 
and Hayashizaki, Y. 

TITLE Functional annotation of a full-length mouse cDNA collection 

JOURNAL Nature 409 (6821), 685-690 (2001) 

MEDLINE 21085660 
PUBMED 11217851 
REFERENCE 5 

AUTHORS The FANTOM Consortium and the RIKEN Genome Exploration Research 
Group Phase I & II Team. 

TITLE Analysis of the mouse transcriptome based on functional annotation 

of 60,770 full-length cDNAs 

JOURNAL Nature 420, 563-573 (2002) 
REFERENCE 6 (bases 1 to 3634) 

AUTHORS Adachi,J., Aizawa,K., Akimura,T., Arakawa,T., Bono,H., Carninci,P., 
Fukuda,S., Furuno,M., Hanagaki,T., Hara,A., Hashizume, W. , 
Hayashida, K. , Hayatsu,N., Hiramoto,K., Hiraoka,T., Hirozane,T., 
Hori,F., Imotani,K., Ishii,Y., Itoh,M. , Kagawa,I., Kasukawa,T., 
Katoh,H., Kawai,J., Kojima,Y., Kondo,S., Konno,H., Kouda,M. , 
Koya,S., Kurihara,C, Matsuyama, T . , Miyazaki,A., Murata,M., 
Nakamura,M. , Nishi,K., Nomura, K., Numazaki,R., Ohno,M., Ohsato,N., 
Okazaki,Y., Saito,R., Saitoh, H . , Sakai,C, Sakai,K., Sakazume, N . , 
Sano,H., Sasaki, D*, Shibata, K* , Shinagawa, A. , Shiraki,T., 
Sogabe,Y., Tagami,M. , Tagawa,A. , Takahashi, F. , Takaku-Akahira, S . , 
Takeda,Y. f Tanaka,T., Tomaru,A. , Toya,T., Yasunishi, A. , 
Muramatsu,M. and Hayashizaki, Y. 

TITLE Direct Submission 

JOURNAL Submitted (16-JUL-2001) Yoshihide Hayashizaki, The Institute of 
Physical and Chemical Research (RIKEN) , Laboratory for Genome 
Exploration Research Group, RIKEN Genomic Sciences Center (GSC) , 
RIKEN Yokohama Institute; 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, 
Kanagawa 230-0045, Japan (E-mail : genome-res @gsc . riken . go . jp, 
URL : http : //genome . gsc . riken . go . jp/ , Tel : 81-45-503-9222, 
Fax:81-45-503-9216) 
COMMENT cDNA library was prepared and sequenced in Mouse Genome 

Encyclopedia Project of Genome Exploration Research Group in Riken 
Genomic Sciences Center and Genome Science Laboratory in RIKEN. 
Division of Experimental Animal Research in Riken contributed to 
prepare mouse tissues. 

Please visit our web site for further details. 
URL : http : / /genome . gsc . riken . go . jp/ 
URL: http : //f an torn. gsc . riken. go . jp/ . 
FEATURES Location/Qualifiers 
source 1. .3634 

/organism="Mus musculus" 

/mol_type="mRNA" 

/strain-"C57BL/6J" 

/db__xref= ,l FANTOM_DB:A530097B07" 

/db_xref="taxon: 10090" 

/clone= M A530097B07" 

/sex- M male" 

/tissue_type="aorta and vein" 


/cl one lib— "RIKEN full— length, enriched mouse cDNA library" 
/dev_stage=" adult" 
CDS 450. .1955 

/no te= "unnamed protein product; beta-site APP cleaving 

enzyme (MGD | MGI : 134 6542, GB I NM__011792, evidence: BLASTN, 

98%, match=3874) 

putative" 

/codon_start=l 

/protein_id="BAC30889. 1" 

/db_xref="GI: 26334343" 

/ translation="MAPALHWLLLWVGSGMLPAQGTHLGIRLPLRSGLAGPPLGLRLP 
RETDEESEEPGRRGSFVEMVDNLRGKSGQGYYVEMTVGSPPQTLNILVDTGSSNFAVG 
AAPHPFLHRYYQRQLSSTYRDLRKGVYVPYTQGKWEGELGTDLVSIPHGPNVTVRANI 
AAITESDKFFINGSNWEGILGLAYAEIARPDDSLEPFFDSLVKQTHIPNIFSLQLCGA 
GFPLNQTEALASVGGSMIIGGIDHSLYTGSLWYTPIRREWYYEVIIVRVEINGQDLKM 
DCKEYNYDKS I VDS GTTNLRLPKKVFEAAVKS I KAAS STEKFPDGFWLGEQLVCWQAG 
TTPWNIFPVISLYLMGEVTNQSFRITILPQQYLRPVEDVATSQDDCYKFAVSQSSTGT 
VMGAVIMEGFYWFDRARKRI GFAVSACHVHDEFRTAAVEGPFVTADMEDCGYNI PQT 
DESTLMTIAYVMAAI CALFMLPLCLMVCQWRCLRCLRHQHDDFADDISLLK" 

BASE COUNT 818 a 1022 c 981 g 813 t 

ORIGIN 

Query Match 75.1%; Score 1554; DB 11; Length 3634; 

Best Local Similarity 87.7%; Pred. No. 3.4e~192; 

Matches 1801; Conservative 0; Mismatches 240; Indels 13; Gaps 9; 

Qy 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

!!!!!!! Ill Mil I I II I I I I M I I I I I Mill Mill M II I M II I I II 
Db 450 ATGGCCCCAGCGCTGCACTGGCTCCTGCTATGGGTGGGCTCGGGAATGCTGCCTGCCCAG 509 

Qy 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 120 

M Mill f I M M II I M II I I M II I I M II M 1 M 1 I M I I II II II I 

Db 510 GGAACCCATCTCGGCATCCGGCTGCCCCTTCGCAGCGGCCTGGCAGGGCCACCCCTGGGC 569 

Qy 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 180 

Ml I II II M I II M I I II II I II I II I M II II M I I 11 I II I MINIMI 
Db 570 CTGAGGCTGCCCCGGGAGACCGACGAGGAATCGGAGGAGCCTGGCCGGAGAGGCAGCTTT 62 9 

Qy 181 GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGATGACC 240 

I I I M M I II I II I I I I II M I I M I I II I I II II II I I I I I I I I I M M I II II I 

Db 630 GTGGAGATGGTGGACAACCTGAGGGGAAAGTCCGGCCAGGGCTACTATGTGGAGATGACC 689 

Qy 241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 300 

II II M I M M II I II II I II II II I II I II II II I II Mill II II M II I II I 

Db 690 GTAGGCAGCCCCCCACAGACGCTCAACATCCTGGTGGACACGGGCAGTAGTAACTTTGCA 74 9 

Qy 301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

I I M I I II II I II Mill II II II M II M II II I II II I II II II II I II II I I M 

Db 750 GTGGGGGCTGCCCCACACCCTTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 809 

Qy 361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 420 

M II II II II II M M II M II I II II II M II II M II I M II II II I I II II I 

Db 810 TAT C GAGAC CT C CGAAAGG GT GTGTAT GT GC C CTACACC CAGGG CAAGT GGGAGGGGGAA 869 


Qy 

Db 


421 
870 


CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 4 80 
II II II I M I II I II If I II M II I I II I II II II II I II M II II M I II I II II I I 
CTGGGCACCGACCTGGTGAGCATCCCTCATGGCCCCAACGTCACTGTGCGTGCCAACATT 929 


Qy 481 GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATCCTG 54 0 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I M II I I I I II I I I I I MINIM 

Db 930 GCTGCCATCACTGAATCGGACAAGTTCTTCATCAATGGTTCCAACTGGGAGGGCATCCTA 98 9 

Qy 541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

I I I I I I M I I I II I I I I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I 

Db 990 GGGCTGGCCTATGCTGAGATTGCCAGGCCCGACGACTCTTTGGAGCCCTTCTTTGACTCC 104 9 

Qy 601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I 

Db 1050 CTGGTGAAGCAGACCCACATTCCCAACATCTTTTCCCTGCAGCTCTGTGGCGCTGGCTTC 1109 

Qy 661 C C C CT CAAC CAGT CT GAAGT GCTGGC CT CTGTC GGAGGGAGCAT GAT CATT GGAGGTAT C 720 

I i I I II I I I I I I I II I MINIM M N II N I I I II I II I I II I 1 1 II II I 

Db 1110 C C C CT CAAC CAGAC C GAGGCACT GGC CT CGGT G GGAGG GAGCAT GAT CATT G GT GGT AT C 1169 

Qy 721 GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTATTAT 7 80 

M II I I I M I I Mill M I II M II II M I II II II I II N I M M I I I I II I II II 

Db 1170 GAC CACT CGCT AT ACAC GGGCAGT CTCTGGT ACACACC CAT CCGGCGGGAGT GGT AT TAT 1229 

Qy 7 81 GAGGTCAT CATTGT GCGGGTGGAGAT CAAT GGACAGGATCT GAAAAT GGACTGCAAGGAG 840 

I I M I N II II I N lllll llllllll M M I M N 1 I f II I 1 f I i I 1 M I 

Db 1230 GAAGT GAT CAT T GT AC GT GT GGAAAT CAAT GGT C AAGAT C T CAAGAT GGAC T GC AAGGAG 1289 

Qy 841 TACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCCAAGAAA 900 

I M I II I I I II II I N II I M II M II M M I II M I M I M I M M M M M II II 

Db 1290 TACAACTACGACAAGAGCATTGTGGACAGTGGGACCACCAACCTTCGCTTGCCCAAGAAA 1349 

Qy 901 GTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCTGAT 960 

I I I II I ! I I II I I II I M II N I M I N N II II M N M M I N II I N N Ml 

Db 1350 GTATTTGAAGCTGCCGTCAAGTCCATCAAGGCAGCCTCCTCGACGGAGAAGTTCCCGGAT 14 09 

Qy 961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 102 0 

M II I I I I I I II M 11 M II 11 II II II 11 I I I M I I 11 I I II II I M M I I II 

Db 1410 GGCTTTTGGCTAGGGGAGCAGCTGGTGTGCTGGCAAGCAGGCACGACCCCTTGGAACATT 14 69 

Qy 1021 TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCACC 1080 

M II M II I M II I I I lllll II II I II I II Mill II I I I I II II II II II II 

Db 1470 T T C C CAGT CAT T T C AC TT T AC CT CAT G GGT GAAGT C AC CAAT CAGT C C T T C C GCAT C AC C 1529 

Qy 1081 AT CCTT CCGCAGCAATACCT GCGGCCAGTGGAAGATGT GGCCACGTC CCAAGACGACTGT 114 0 

llllllll II I I I II I! I I Mill lllll II II II I II II II II II M I II I II I 

Db 1530 AT CCTTCCT CAGCAATACCTAC GGC CGGT GGAGGACGT GGCCACGTC CCAAGACGACTGT 15 89 

Qy 1141 T ACAAGT TT GC CATC T CACAGT CAT CCAC GGGCACT GTT AT GG GAGC TGT T AT CATGGAG 1200 

llllllll II II I N M I II I II II I I II II II II I M M II II II I N II I N 

Db 1590 T ACAAGT T C GCT GT C T CACAGT CAT CCAC GGGCACT GT TAT G GGAGC CGT CAT CAT GGAA 1649 

Qy 1201 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1260 

II INN II I II II lllll MINIM I I I I I N II II I I II I II I II II I I I I 

Db 1650 GGTTTCTATGTCGTCTTCGATCGAGCCCGAAAGCGAATTGGCTTTGCTGTCAGCGCTTGC 1709 

Qy 1261 CAT GT GCAC GAT GAGTTCAGGAC GGCAGC GGT GGAAGG CC CT T TT GT CAC CT TGGACAT G 1320 

II II II N I II M N II I I M II I II II I N I II II II lllll N II I I I I 

Db 1710 CAT GT GCAC GAT GAGTTCAG GAC GG CGGC AGT GGAAGGTC C GT TT GTTAC GGCAGACAT G 1769 


Qy 1321 GAAGACT GT GGC TACAACATTC CACAGACAGATGAGT CAAC C CT CAT GAC CATAGC C TAT 1380 

II I I I I II M I I I I I I II I I I I I II I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I 
Db 1770 GAAGACT GT GGCTACAACATT C C CCAGACAGATGAGT CAACACT TAT GAC CATAGC CTAT 1829 

Qy 1381 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 144 0 

1)111111 I I II t I I I I M I I I I I I II M M I I II II I I II I I M I I I I I I I I I I M 

Db 1830 GTCATGGCGGCCATCTGCGCCCTCTTCATGTTGCCACTCTGCCTCATGGTATGTCAGTGG 1889 

Qy 14 41 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I | I | | I | I | | | I I I I 
Db 1890 CGCTGCCTGCGTTGCCTGCGCCACCAGCACGATGACTTTGCTGATGACATCTCCCTGCTC 194 9 

Qy 1501 AAGT GAGGAG GC C CAT GGGCAGAAGATAGAGAT T C C CCT GGACCACAC CTC C GT G GTT CA 1560 1 

I I I I I I I I I II I I I I I I I I I I III I I I I I I I I I I I I I I I I I II I I I I I I I 

Db 1950 AAGTAAGGAGGCCCGTGGGCAGATGATGGAGACGCCCCTGGACCACATCTGGGTGGTTCC 2009 

Qy 1561 CTTT GGTCACA- - AGTAGGAGACACAGAT GGCACCT GTGGCCAGAGCACCTCAGGACCCT 1618 

II M M I I I I 1 Ml I 11 1 I Mill II II M M I I II M I M M M I I I M II 

Db 2010 CTTT GGT CACAT GAGTT GGAGCT AT GGAT GGT ACCT GT^GGC CAGAGCAC CT CAGGAC CCT 2069 

Qy 1619 CCCCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCC 1678 

I I II I I I II II I I I II I I II III II I I I II I I I I II I I I I 

Db 207 0 CAC CAAC CT GC CAAT GCTT CTGG CGT GACAGA- ACAGAGAAAT CAGGCAAGCT GGATTAC 2128 

Qy 1679 AGGGACTGTACCTGTAGGAAACAGAAAAGAGAAGAAAGAAGCACTCTGCTGGCGGGAATA 1738 

MM M M M M II M M II I II II M M I I II M M M M II II M 

Db 2129 AGGGCTTGCACCTGTAGGACACAGGAGAGGGAAGGAAGCAGCGTTCTGGTGGCAGGAATA 2188 

Qy 1739 CTCTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAAC 1798 

III I I II I I I II II III I I II I II II I II I II II I I I I I II II I I I I I 
Db 2189 T CCT TAGACAC CACAAAC T T GAGT - T GGAAAT TTT GCTGCTT GAAGCTT CAGC C C T GAC C 2247 

Qy 1799 CTTT GT CCACCATT CCT T T AAAT T CT CCAACCCAAAGT AT T CT T CT T TT CTT AGT T T CAG 1858 

M M M I M M M II I I M I M I I M M M I M M M Ml II I M 

Db 224 8 CTCTGCCCAGCA-TCCTTTAGAGTCTCCAACCTAAAGTATTCTTTATGTC CTTCCAG 2303 

Qy 1859 AAGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGA 1918 

II I I I I I I II III II I II I MM III II I II II II I I I I II I II II I M I II I 

Db 2304 AAGT ACT G GC GT CAT ACT CAGGC TAC C - CGGCATGT GTC C CT GTGGTACC CT GGCAGAGA 2362 

Qy 1919 AGAGACCAAGCTT GTTT C C CT GCT GGC CAAAGT CAGTAG GAGAGGATGCACAGTTT GCTA 197 8 

I I I I II I I M I I M M I M I M II M I II I I I I I II I II II II I 

Db 2363 AAGGGCCAAT C - T CATT CCCT GCT GGCCAAAGT CAGCAGAAGA — AAGTGAAGTTTGCCA 2419 

Qy 1979 TTTGCTTTAGAGACAGGGACTGTATAAACAAGCCTAACATTGGTGCAAAGATTGCCTCTT 2 038 

I I II I II I I II II I II I I I I I II I I II I III MM I II II I III I II I 
Db 2420 GTTGCTTTAGTGATAGGGACTGCAGACTCAAGCCT-ACACTGGTACAAAGACTGCGTCTT 2478 

Qy 2039 GAATTAAAAAAAAA 2052 

II II II II II 

Db 2479 GAGATAAACAAGAA 2492 


RESULT 3 
AK033112 

LOCUS AK033112 3877 bp mRNA linear HTC 05-DEC-2002 

DEFINITION Mus musculus 15 clays embryo male testis cDNA, RIKEN full-length 
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FEATURES Location/Qualifiers 
source 1. .3877 

,/organism="Mus musculus" 

/mol_type= "mRNA" 

/strain="C57BL/6J M 

/ db_x r e f = " FANTOM_DB : 8 0 3 0 4 3 1 G 04 " 

/db_xref="taxon: 10090" 

/clone="8 030431G04" 

/sex="male" 

/tissue_type="testis " 

/clone__lib="RIKEN full-length enriched mouse cDNA library" 
/dev_stage="15 days embryo" 
CDS 450. .1955 

/note="unnamed protein product; beta-site APP cleaving 

enzyme (MGD | MGI : 1346542, GB | NM_0117 92 , evidence: BLASTN, 

98%, match=3874) 

putative " 

/codon_start=l 

/protein_id="BAC28156. 1" 


/db_xref="GI: 26328835" 

/ translation="MAPALHWLLLWVGSGMLPAQGTHLGIRLPLRSGLAGPPLGLRLP 
RETDEESEEPGRRGSFVEMVDNLRGKSGQGYYVEMTVGSPPQTLNILVDTGSSNFAVG 
AAPHPFLHRYYQRQLSSTYRDLRKGVYVPYTQGKWEGELGTDLVSIPHGPNVTVRANI 
AAITESDKFFINGSNWEGILGLAYAEIARPDDSLEPFFDSLVKQTHIPNIFSLQLCGA 
GFPLNQTEALASVGGSMIIGGIDHSLYTGSLWYTPIRREWYYEVIIVRVEINGQDLKM 
DCKEYNYDKSIVDSGTTNLRLPKKVFEAAVKSIKAASSTEKFPDGFWLGEQLVCWQAG 
TT PWNI FPVI S L YLMGEVTNQS FRI T I L PQQYLRPVEDVAT SQDDC YKFAVSQS STGT 
VMGAVI MEG F YWF D RAR K R I G FAVS AC H VH D E FRT AAVE G P FVT ADME D C G YN I P QT 
DESTLMTIAYVMAAICALFMLPLCLMVCQWRCLRCLRHQHDDFADDISLLK" 

BASE COUNT 877 a 1090 c 1030 g 880 t 

ORIGIN 

Query Match 75.1%; Score 1554; DB 11; Length 3877; 

Best Local Similarity 87.7%; Pred. No. 3.3e-192; 

Matches 1801; Conservative 0; Mismatches 24 0; Indels 13; Gaps 9; 

Qy 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

I I I I I I I III I I I I I I I I I I I I I I II III I I I I I I I I M II I I I I I I I I I II 

Db 4 50 ATGGCCCCAGCGCTGCACTGGCTCCTGCTATGGGTGGGCTCGGGAATGCTGCCTGCCCAG 509 

Qy 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 120 

II Mill I I I II I I M M I M M I I I I M M I I I II II I II I MINIM 

Db 510 GGAACCCATCTCGGCATCCGGCTGCCCCTTCGCAGCGGCCTGGCAGGGCCACCCCTGGGC 569 

Qy 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 180 

II I I I I II I I II I I II I II II I I II II I I II I I I II II I I II I I II I I I II II 
Db 570 CT GAGGCT G CC C C GGGAGAC C GAC GAGGAAT C GGAGGAGC CT GGC CGGAGAGG CAGCT TT 629 

Qy 181 GT GGAGAT GGT GGAC AACCT GAGGGGC AAGT C GGGGCAGGGCT ACT ACGT GGAGATGACC 240 

I I II II II II II II I II II II I II M Mill II II I I II II II I II II II II I I II 

Db 630 GTGGAGAT GGT GGACAACCT GAGGGGAAAGT CCGGCCAGGGCT ACTAT GTGGAGAT GACC 689 

Qy 241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 300 

M II I II II II II II II II I II II I II I M II II II II II II I II II II I II M I 

Db 690 GT AG G CAGC CC C C CACAGAC GCT CAACAT CCT GGT GGACAC GGGCAGTAGTAACTTT GCA 74 9 

Qy 301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

Mill M II M M M M I I II II M II M M M M M M I M M II I M M M M M 

Db 750 GTGGGGGCTGCCCCACACCCTTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 809 

Qy 361 TAC C G GGAC CT C C GGAAGG GT GT GTAT GT GC C CTACACC CAGGGCAAGT GGGAAGGGGAG 420 

M II II II II II I II II I II II I II I I II M II II II I I II II II II I II I II II 
Db 810 T AT CGAGAC CT C C GAAAGGGT GT GTAT GT GC C CTACACC CAGGGCAAGT GGGAG GGGGAA 869 

Qy 421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 480 

II II II II II II II II I II I II I II II I II I II I M I II II I II I II II II I II II II 

Db 870 CTGGGCACCGACCTGGTGAGCATCCCTCATGGCCCCAACGTCACTGTGCGTGCCAACATT 929 

Qy 481 GCT G C CAT CACT GAAT CAGACAAGTTCT T CAT CAAC GGCT C CAACT GGGAAG GCAT C CTG 54 0 

M M M M M M II I M M M M II I M M M M M M I M II II M M M M M 

Db 930 GCT GC CAT CACT GAAT C GGACAAGT TCT T CAT CAAT GGTT C CAACTGGGAGGGCAT C CTA 98 9 

Qy 541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

M II II II II II M II I I II II II II II I II II II II M II II I II II I I II II I 
Db 990 GGGCTGGCCTATGCTGAGATTGCCAGGCCCGACGACTCTTTGGAGCCCTTCTTTGACTCC 104 9 


Qy 601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

II I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I II II I I I I I I I I I II I I I 
Db 1050 CTGGTGAAGCAGACCCACATTCCCAACATCTTTTCCCTGCAGCTCTGTGGCGCTGGCTTC 1109 

Qy 661 CCCC T CAAC CAGT CT GAAGT GCTGGCCTCTGT C GGAGGGAGC AT GAT CATT GGAG GTAT C 720 

M I I I I I M M I I II I MINIM M M M II M M I II II M I M MINI 

Db 1110 CCCCTCAACCAGACCGAGGCACTGGCCTCGGTGGGAGGGAGCATGATCATTGGTGGTATC 1169 

Qy 721 GACCACT CGCT GT ACACAGGCAGT CT CT GGT ATACAC C CAT C C GGCGGGAGT GGTAT TAT 780 

I I II I I I I I M I I I I I II I II I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I II II 

Db 1170 GAC CACT C GCTATACAC GG GCAGT CT C TGGTACACAC C CAT C C GGCGGGAGT G GTAT TAT 1229 

Qy 781 GAGGT CAT CATT GTGC GGGT GGAGAT CAATGGACAG GAT CT GAAAATGGACT GCAAGGAG 84 0 

II II I I II I I I I II Mill II I I I II I II II I I I II I II I II I II I II I II 

Db 1230 GAAGT GAT CATT GT AC GT GT GGAAAT C AAT GGT C AAGAT CT CAAGAT GGACT GCAAGGAG 1289 

Qy 841 TACAACTAT GACAAGAGCAT T GT GGACAGT GGCAC CAC CAACCTT C GT T T G C C CAAGAAA 900 

M M II M I I M I I M M M M M I M I M I M M M M I M M I I M M M M I M 
Db 1290 TACAACTAC GACAAGAGCATT GT GGACAGTGGGAC CACCAACCTTCGCTT GCCCAAGAAA 1349 

Qy 901 GT GT T T GAAG CT GCAGT CAAAT CCAT CAAG GCAGCCT C CT CCACGGAGAAGTT CCCTGAT 960 

II I II I I II I I I I I I I I I II I II I I II II I I I I I I II I II I II II II I I II I III 
Db 1350 GTAT TT GAAGCT GCC GT CAAGT C CAT CAAG GCAGC CT C CT CGAC GGAGAAGT T C C C GGAT 14 09 

Qy 961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 102 0 

M II I I I II 1 II I II M II I I II I II I I I I M II I I I I I II I I I I I II I I I I I II I 
Db 1410 GGCTTTTGGCTAGGGGAGCAGCTGGTGTGCTGGCAAGCAGGCACGACCCCTTGGAACATT 1469 

Qy 1021 TT C C CAGT CAT CT CACT CT AC CTAAT GGGT GAGGTT AC CAAC CAGT C CTT C C GCAT CACC 108 0 

II I I I II I I I I I II I I Mill II II II II II Mill II II It I I M 1 II I I M I 

Db 1470 T T C C CAGT CATT T CACT T TAC CT C AT GGGT GAAGT CAC CAAT CAGT C CT T C C GCAT CACC 152 9 

Qy 1081 AT C CTT C C GCAGCAATAC CT GC GGC CAGT GGAAGAT GT GGCCACGT C C CAAGAC GACT GT 114 0 

I I II II II I II M II II II Mill II I I I II II M II II II II II II II I II II I 

Db 1530 AT CCT T CCT CAGCAATAC CT AC GGCCGGT GGAGGACGTGGCCACGT CC CAAGAC GACT GT 1589 

Qy 1141 TACAAGT T TGCCAT CT CACAGT CAT C CAC GG GCACT GTTATGGGAGCT GTTAT CAT GGAG 1200 

MINIM II !!!!!!!!!!!! I !!! I !!!!!!! I ! t t i !!! t ! N llllllll 
Db 1590 TACAAGTT C GCT GT CT CACAGTCAT CCAC GG GCACT GTTAT GGGAGC C GT CAT CAT GGAA 164 9 

Qy 1201 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 12 60 

II I I II I N II I II I I I II llllllll I I I I II II II II I I I I II I I I I I II I I 

Db 1650 GGTTTCTATGTCGTCTTCGATCGAGCCCGAAAGCGAATTGGCTTTGCTGTCAGCGCTTGC 17 09 

Qy 1261 CAT GT GCACGAT GAGT T CAG GAC GGCAGC GGT GGAAGGCCCT TT T GT CAC CT T GGACAT G 1320 

I I I I I II I M II II I I II I I II I II I II llllllll II I II II N I I II I I 

Db 1710 CATGTGCACGATGAGTTCAGGACGGCGGCAGTGGAAGGTCCGTTTGTTACGGCAGACATG 1769 

Qy 1321 GAAGACTGT GGCTACAACATT C CACAGACAGAT GAGT CAACCCT CAT GACCATAGCCTAT 1380 

IIIIIIMIIIIIIIIIMIMI I I | l l M | | I M l I I M M I I M I M I I I I I I I I 

Db 177 0 GAAG AC T GT G G C T AC AAC AT T C C C CAG AC AG AT GAGT CAAC AC T TAT GAC CAT AG C C TAT 1829 

Qy 1381 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 1440 

llllllll II I I II II I II I I I I II II I I I I I I II I II II I II I II I I I I II I N I I 
Db 1830 GTCATGGCGGCCATCTGCGCCCTCTTCATGTTGCCACTCTGCCTCATGGTATGTCAGTGG 18 8 9 

Qy 1441 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 


I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I M I I I I M I I I I I I I I I I I I I I I I 

Db 1890 CGCTGCCTGCGTTGCCTGCGCCACCAGCACGATGACTTTGCTGATGACATCTCCCTGCTC 194 9 

Qy 1501 AAGTGAGGAGGCCCATGGGCAGAAGATAGAGATTCCCCTGGACCACACCTCCGTGGTTCA 1560 

I I I I I I M I I I I I I I I II I I I III MM II I I I I II M I M II II I M I I 

Db 1950 AAGTAAGGAGG C C C GT GGGCAGAT GATGGAGACGC C C CT GGAC CACAT CT GGGT GGTT C C 2009 

Qy 1561 CTTTGGTCACA — AGTAGGAGACACAGAT G GCAC CT GT GGC CAGAGCAC CT CAGGAC C C T 1618 

M II II M I M III MM 1 Mill I M I I M M II I II M M I M II I M M 

Db 2 010 CTTTGGTCACATGAGTTGGAGCTATGGATGGTACCTGTGGCCAGAGCACCTCAGGACCCT 2069 

Qy 1619 CCCCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCC 167 8 

I I II II I I I I I I I II 1 I II I Mi MM I II I I II II I II I 

Db 2070 CACCAACCTGCCAATGCTTCTGGCGTGACAGA-ACAGAGAAATCAGGCAAGCTGGATTAC 2128 

Qy 1679 AGGGACTGTACCTGTAGGAAACAGAAAAGAGAAGAAAGAAGCACTCTGCTGGCGGGAATA 173 8 

MM I I II II I I I II I I I II I I I II I I II I I I I MM I I I I II M M 

Db 2129 AGGGCTTGCACCTGTAGGACACAGGAGAGGGAAGGAAGCAGCGTTCTGGTGGCAGGAATA 218 8 

Qy 1739 CTCTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAAC 1798 

III I MM MM M III I I II I I I II M I II M II II M I I I I I II I I 

Db 2189 T C CT TAGACAC C ACAAACT T GAGT - TGGAAATT TT GCT GC T T GAAGCT T CAGC C CT GAC C 224 7 

Qy 1799 CTTTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAG 1858 

II M III M II I I M I I I I I M M M M I I M I I I M III I I I I I 

Db 2248 CTCTGCCCAGCA-TCCTTTAGAGTCTCCAACCTAAAGTATTCTTTATGTC CTTCCAG 2303 

Qy 1859 AAGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGA 1918 

II II II I I I I III II I II I MM III I I II M M I I I I I II II I I I II I II I I 

Db 2304 AAGTACTGGCGTCATACTCAGGCTACC-CGGCATGTGTCCCTGTGGTACCCTGGCAGAGA 2362 

Qy 1919 AGAGACCAAGCT T GT TT CCCT GCT GGC CAAAGT CAGT AGGAGAGGAT GCACAGTTT GCT A 197 8 

I I I II I I I II I I M I II M II I M I II I I I I II I II I II I II I I 
Db 2363 AAGGGC CAAT C - T CAT TCCCTGCT GGC CAAAGT CAG CAGAAGA — AAGTGAAGTTT GC CA 2419 

Qy 197 9 TTTGCTTTAGAGACAGGGACTGTATAAACAAGCCTAACATTGGTGCAAAGATTGCCTCTT 203 8 

M M M I M M M M II M I I M M M I Ml MM M M M Ml I I II 

Db 242 0 GTTGCTTTAGTGATAGGGACTGCAGACTCAAGCCT-ACACTGGTACAAAGACTGCGTCTT 24 7 8 

Qy 2039 GAATTAAAAAAAAA 2052 

M I I II I I II 

Db 2479 GAGATAAACAAGAA 2492 
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AK049626 4046 bp mRNA linear HTC 05-DEC-2002 

Mus musculus 12 days embryo spinal cord cDNA, RIKEN full-length 
enriched library, clone : C530008K17 product : beta-site APP cleaving 
enzyme, full insert sequence. 
AK049626 

AK04 962 6. 1 GI : 2 634 0361 
HTC; CAP trapper. 
Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
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TITLE Direct Submission 

JOURNAL Submitted ( 16- JUL-2001 ) Yoshihide Hayashizaki, The Institute of 
Physical and Chemical Research (RIKEN) , Laboratory for Genome 
Exploration Research Group, RIKEN Genomic Sciences Center (GSC) , 
RIKEN Yokohama Institute; 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, 
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FEATURES Location/Qualifiers 
source 1. .4046 

/organism="Mus mus cuius" 

/mol_type="mRNA" 

/strain="C57BL/6J" 

/db_xref="FANTOM_DB:C530008K17" 

/db_xref="taxon: 10090" 

/clone="C530008K17" 

/tissue_type="spinal cord" 

/ clone_lib=" RIKEN full-length enriched mouse cDNA library" 
/dev_stage-"12 days embryo" 
CDS 433. .1938 

/note="unnamed protein product; beta-site APP cleaving 

enzyme (MGD | MGI : 1346542 , GB | NM__011792, evidence: BLASTN, 

98%, match=3874) 

putative " 

/ codon__start=l 

/protein_id="BAC33844 .1" 

/db_xref="GI: 26340362" 

/translation="MAPALHWLLLWGSGMLPAQGTHLGIRLPLRSGLAGPPLGLRLP 
RETDEESEEPGRRGSFVEMVDNLRGKSGQGYYVEMTIGSPPQTLNILVDTGSSNFAVG 
AAPHPFLHRYYQRQLSSTYRDLRKGVYVPYTQGKWEGELGTDLVSIPHGPNVTVRANI 
AAITESDKFFINGSNWEGILGLAYAEIARPDDSLEPFFDSLVKQTHIPNIFSLQLCGA 
GFPLNQTEALASVGGSMIIGGIDHSLYTGSLWYTPIRREWYYEVIIVRVEINGQDLKM 
DCKEYNYDKSIVDSGTTNLRLPKKVFEAAVKSIK7VVSSTEKFPDGFWLGEQLVCWQAG 
TTPWNI FPVI SLYLMGE VTNQS FRITI LPQQYLRPVEDVATSQDDCYKFAVSQS STGT 
VMGAVI ME G F YWFD RARKRI G FAVS ACHVH D E FRT AAVE G P FVT ADME DC G YN I P QT 
DESTLMTIAYVMAAICT^FMLPLCLMVCQWRCLRCLRHQHDDFADDISLLK" 


BASE COUNT 951 a 1102 c 1056 g 937 t 

ORIGIN 

Query Match 75.0%; Score 1552.4; DB 11; Length 4046; 

Best Local Similarity 87.6%; Pred. No. 5.2e-192; 

Matches 1800; Conservative 0; Mismatches 241; Indels 13; Gaps 9; 

Qy 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

I I I I I I I I M I I I I I I I I I I I I I I I I III I I I I I | | | | | I I I I I I I I I I I I I 
Db 433 ATGGCCCCAGCGCTGCACTGGCTCCTGCTATGGGTGGGCTCGGGAATGCTGCCTGCCCAG 4 92 

Qy 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 12 0 

M I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I | | || | | | | | | | | | 
Db 4 93 GGAACCCATCTCGGCATCCGGCTGCCCCTTCGCAGCGGCCTGGCAGGGCCACCCCTGGGC 552 

Qy 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 180 

I M I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 553 CTGAGGCTGCCCCGGGAGACCGACGAGGAATCGGAGGAGCCTGGCCGGAGAGGCAGCTTT 612 

Qy 181 GT GGAGAT GGT GGACAAC CT GAGGGG CAAGT C GG GGC AG GGCT ACT AC GT GGAGAT GAC C 24 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I M I | 
Db 613 GT GGAGAT GGT GGACAAC CT GAG GGGAAAGT C C GGC CAGGG CT ACT AT GTGGAGAT GAC C 672 

Qy 241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I 
Db 673 ATAGGC AGC C C C C C AC AGAC GCT CAACAT C C T GGT GGACAC GGGC AGT AGTAAC T TT G C A 7 32 

Qy 301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 3 60 

I I I I I I I I I I I I I I I II I M I I I II I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 733 GTGGGGGCTGCCCCACACCCTTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 792 

Qy 361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 420 

M M I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I 
Db 793 TATCGAGACCTCCGAAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAGGGGGAA 852 

Qy 421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 480 

I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I 
Db 853 CT GGGCAC C GAC C T G GT GAGCATCC CT CAT GGC C C CAAC GT C ACT GT GC GT GC CAACAT T 912 

Qy 4 81 GCT GC CAT C AC T GAAT CAGACAAGT T CT T CAT CAAC GGCT C CAACT GG GAAGGC AT C CT G 540 

I I I I I I I I M I I I II I I I I I I I I I I I I I I I I | || || I I I I I I I I I I I I I I I I I I I 
Db 913 GCTGCCATCACTGAATCGGACAAGTTCTTCATCAATGGTTCCAACTGGGAGGGCATCCTA 972 

Qy 541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I | | | | | | | | M 
Db 973 GGGCTGGCCTATGCTGAGATTGCCAGGCCCGACGACTCTTTGGAGCCCTTCTTTGACTCC 1032 

Qy 601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

Mill MINIMUM I I I | | | | | | | | I I I I I I I I I II I I I I I I I I I I I I I I 
Db 1033 CTGGTGAAGCAGACCCACATTCCCAACATCTTTTCCCTGCAGCTCTGTGGCGCTGGCTTC 1092 

Qy 661 CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGGTATC 720 

I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1093 CCCCTCAACCAGACCGAGGCACTGGCCTCGGTGGGAGGGAGCATGATCATTGGTGGTATC 1152 


Qy 721 GAC C ACT C GCT GT AC AC AGGC AGT CT C T GGT AT AC AC C CAT CC GG C GG GAGT GGT AT TAT 780 

I I I I M I I I I I I I I II I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I | | | 


1153 GACCACTCGCTATACACGGGCAGTCTCTGGTACACACCCATCCGGCGGGAGTGGTATTAT 1212 

781 GAGGT CAT CAT T GT GC GG GT GGAGAT CAAT GGAC AGGATCT GAAAAT GGAC T GCAAG GAG 840 
II N I I I I I I I I || | | | | | MINIM I I | | M I II I I I I I I II I I I I I I I 
1213 GAAGT GAT CAT T GTAC GT GT GGAAAT CAAT GGT CAAGATCT CAAGAT GGAC T GCAAG GAG 1272 

841 T ACAACT AT GACAAGAGCATT GT GGACAGT G G CAC CACCAAC CT T C GTT T G C C CAAGAAA 900 

I I I I I I I I I I I I I I I I I M I I I I II I I I I I I I I II I I I I I I I I | | I I I I I I I I I I M 
1273 T ACAAC T AC GACAAGAGCAT T GT GGACAGT G GGAC CAC CAAC CT T CGCTTGCC CAAGAAA 1332 

901 GTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCTGAT 960 
M I I I I I I I I I I I | | | | | | | | | | | | | | | M | | | | | | | | | | | | | M | | | | | | | M I 
1333 GT ATT T GAAGCT GC C GT CAAGT C CAT CAAGGCAGC CT C CT C GAC GGAGAAGT T C C C G GAT 1392 

961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 1020 

II II I I I I I M I I I I I I I I I | | | || | | | | M | | | | | | | | | | M I I | | I I I I I II I I 
1393 GGCTTTTGGCTAGGGGAGCAGCTGGTGTGCTGGCAAGCAGGCACGACCCCTTGGAACATT 1452 

1021 T T C C CAGT CAT CT CACT CT AC CTAATGGGT GAGGT T AC CAAC CAGT C CT T C C GCAT CAC C 1080 

I I I I I I I I I I I | | | | | | | | | | | | M I I I I II I I I I I I I I I I I I I I I I I I I I I | | 
14 53 T T C C CAGT CAT T T CACT TT AC CT CAT GGGT GAAGT CAC CAAT CAGT C CT T C C GC AT CAC C 1512 

1081 ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGACTGT 114 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I | I I II M I I I I II I I I I I I I I I I I I I I I I 
1513 ATCCTTCCTCAGCAATACCTACGGCCGGTGGAGGACGTGGCCACGTCCCAAGACGACTGT 1572 

1141 T ACAAGT T T GC CAT CT CACAGT CAT C CAC GGG C ACT GT TAT GGGAG CT GT TAT CAT GGAG 1200 

M I I I- I I I II I I I I I I I I II I | | | | | | | | | | | | | | | | | | | | | | | M I I I I I I I I 
1573 T ACAAGT T C GC T GT CT CACAGT CAT C CAC GGG CACT GT T AT G GGAGC C GT CAT CAT GGAA 1632 

12 01 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1260 

N HIM II I II II I I I I I I I I I I I I I I | | | | | | | | | || | | | | | | | | | | | | | | | 
1633 GGTTTCTATGTCGTCTTCGATCGAGCCCGAAAGCGAATTGGCTTTGCTGTCAGCGCTTGC 1692 

12 61 CAT GT GC AC GAT GAGTTCAGGAC GGCAG C GGT GGAAGGC C CT TT T GT C AC CT T GGAC AT G 1320 

I I I I I I I I M M I II I I I I M I I I II II I I I I I I I I II I I I I I || II II I | 

1693 CAT GT GCAC GAT GAGTT CAGGAC GGC GGCAGT GGAAGGT C C GT T T GT T AC GGCAGAC AT G 1752 

1321 GAAGACT GT GGCT ACAACATT C CACAGAC AGAT GAGT CAAC C CT CAT GAC C ATAGC CT AT 138 0 

I I I M I II I I I I I I I I I I I I | | | | | | | | | || | | M || I II I I II I I I 

1753 GAAGACT GT GGCTACAACATT CCCCAGACAGAT GAGTCAACACTTAT GACCATAGCCT AT 1812 

13 81 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 144 0 

I I I I I I I I I I I I I I I I I I I II I I I I M II Ml II || 

1813 GT CAT GG C GGC CAT CT G C GC C CT CT T CAT GTTGC CACT CTGCCT CAT GGTATGT CAGT GG 1872 

14 41 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I | 

1873 CGCTGCCTGCGTTGCCTGCGCCACCAGCACGATGACTTTGCTGATGACATCTCCCTGCTC 1932 

1501 AAGTGAGGAGGCCCATGGGCAGAAGATAGAGATTCCCCTGGACCACACCTCCGTGGTTCA 1560 

I I I I I I I I I I M I I I I I I I I I I I I | | | | I I I I | | | | | | | | | || | | | | | | | 
1933 AAGTAAG GAGGCC C GT GGGCAGAT GAT GGAGAC G C C C CT GGACCACAT CT GGGT GGTT C C 1992 

1561 CTTT GGT CACA- - AGT AGGAGAC ACAGAT GG CAC C T GT GG C CAGAGCAC CT C AGGACC CT 1618 

I I I I I I I I I I I Ml I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I 
1993 CTTT GGT CAC AT GAGTT GGAGCT AT GGATGGTACCT GTGGCCAGAGCACCT CAGGAC C CT 2052 


QY 1619 CCCCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCC 1678 

I I I I I I I I I I I I I I I I I I I I I | | M II I I I I I I I II I I I I 
Db 2 053 CACCAACCTGCCAATGCTTCTGGCGTGACAGA-ACAGAGAAATCAGGCAAGCTGGATTAC 2111 

Qy 167 9 AGGGACT GT AC CT GT AGGAAACAGAAAAGAGAAGAAAGAAGCACT CTGCT GGCGGGAAT A 1738 

MM II I I I I I I I I I I I I I I I I I I I I I I | | | | | I | | | | | | | | | | | | | 
Db 2112 AGGGCTTGCACCTGTAGGACACAGGAGAGGGAAGGAAGCAGCGTTCTGGTGGCAGGAATA 2171 

Qy 1739 CTCTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAAC 17 98 

Ml I I I I I MM II III I M I I II II I I I | || | | | | | M I I II I I I I I 
Db 2172 TCCTTAGACACCACAAACTTGAGT-TGG7VAATTTTGCTGCTTGAAGCTTCAGCCCTGACC 2230 

Qy 1799 CTTTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAG 1858 

M M II I I I II II I II I | || || | || | | | || | | | || m Ml II I I I 

Db 2231 CTCTGCCCAGCA-TCCTTTAGAGTCTCCAACCTAAAGTATTCTTTATGTC CTTCCAG 2286 

Qy 1859 AAGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGA 1918 

N I I M II I I M I II I I II II I I III I II II || | | || | || || || | || || | | || 
Db 22 87 AAGTACTGGCGTCATACTCAGGCTACC-CGGCATGTGTCCCTGTGGTACCCTGGCAGAGA 2345 

Qy 1919 AGAGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCTA 197 8 

I I I M I I I I II I II I II I I I I I | I II III | | II II I II I 

Db 234 6 AAGGGCCAATC-TCATTCCCTGCTGGCCAAAGTCAGCAGAAGA— AAGTGAAGTTTGCCA 24 02 

Qy 197 9 TT T GCT T T AGAGACAGGGACTGT AT AAACAAGC CTAAC ATT GGT G CAAAGAT T GC CT CTT 2 038 

M I M II II II I I I I I I II I I I I I II I I III II I I I I I | I I III I I M 
Db 24 03 GTTGCTTTAGTGATAGGGACTGCAGACTCAAGCCT-ACACTGGTACAAAGACTGCGTCTT 24 61 

Qy 2039 GAATTAAAAAAAAA 2052 

II I I I I I I I I 

Db 24 62 GAGATAAACAAGAA 2475 
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AK046175 4101 bp mRNA linear HTC 05-DEC-2002 

Mus musculus adult male corpora quadrigemina cDNA, RIKEN 
full-length enriched library, clone : B230346M13 product :beta-site 
APP cleaving enzyme, full insert sequence. 
AK046175 

AK04 6175. 1 GI: 2 63378 68 
HTC; CAP trapper. 
Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 
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Analysis of the mouse transcriptome based on functional annotation 
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6 (bases 1 to 4101) 

Adachi,J., Aizawa,K., Akimura,T., Arakawa,T., Bono,H., Carninci,P., 
Fukuda,S., Furuno,M., Hanagaki,T., Hara, A., Hashizume, W. , 
Hayashida, K. , Hayatsu,N., Hiramoto,K., Hiraoka,T., Hirozane,T., 
Hori,F., Imotani,K., Ishii,Y., Itoh,M., Kagawa,I., Kasukawa,T., 
Katoh,H., Kawai,J., Kojima,Y., Kondo,S., Konno,H., Kouda,M., 
Koya,S., Kurihara,C, Matsuyama, T . , Miyazaki,A., Murata,M., 
Nakamura,M., Nishi,K., Nomura, K. , Numazaki,R., Ohno,M. , Ohsato,N., 
Okazaki,Y., Saito,R., Saitoh, H., Sakai,C, Sakai,K., Sakazume,N., 
Sano,H., Sasaki, D., Shibata,K., Shinagawa, A. , Shiraki,T., 


Sogabe,Y., Tagami,M., Tagawa,A., Takahashi, F. , Takaku-Akahira, S . , 
Takeda,Y., Tanaka,T., Tomaru, A. , Toya,T., Yasunishi , A. , 
Muramatsu, M. and Hayashizaki, Y. 
TITLE Direct Submission 

JOURNAL Submitted (16- JUL-2001) Yoshihide Hayashizaki, The Institute of 
Physical and Chemical Research (RIKEN) , Laboratory for Genome 
Exploration Research Group, RIKEN Genomic Sciences Center (GSC), 
RIKEN Yokohama Institute; 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, 
Kanagawa 230-0045, Japan ( E-mail : genome- res @gsc . riken . go . jp, 
URL: http: //genome. gsc. riken . go . jp/, Tel : 81-45-503-9222, 
Fax: 81-45-503-9216) 
COMMENT cDNA library was prepared and sequenced in Mouse Genome 

Encyclopedia Project of Genome Exploration Research Group in Riken 
Genomic Sciences Center and Genome Science Laboratory in RIKEN. 
Division of Experimental Animal Research in Riken contributed to 
prepare mouse tissues. 

Please visit our web site for further details. 
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FEATURES Location/Qualifiers 
source 1. .4101 

/organism="Mus musculus" 

/mol_type= ,, mRNA" 

/strain="C57BL/6J" 

/ db_xref ="FANTOM_DB: B230346M13" 

/db_xref="taxon: 10090" 

/clone="B2 30346M13" 

/sex="male" 

/tissue__type=" corpora quadrigemina" 

/clone_lib="RIKEN full-length enriched mouse cDNA library" 
/ dev_stage="adult" 
CDS 447. .1952 

/note="unnamed protein product; beta-site APP cleaving 

enzyme (MGD | MGI : 1346542 , GB | NM_0117 92 , evidence: BLASTN, 

98%, match=3874) 

putative" 

/codon__start=l 

/protein_id= n BAC32620 . 1" 

/db_xref="GI:26337869" 

/ trans lation="MAPALHWLLLWVGSGMLPAQGTHLGIRLPLRSGLAGPPLGLRLP 
RETDEES EEPGRRGS FVEMVDNLRGKSGQGYYVEMTVGS PPQTLNI LVDTGS SNFAVG 
AAPHPFLHRYYQRQLSSTYRDLRKGVYVPYTQGKWEGELGTDLVSIPHGPNVTVRANI 
AAITESDKFFINGSNWEGILGLAYAEIARPDDSLEPFFDSLVKQTHI PNI FSLQLCGA 
GFPLNQTEALASVGGSMIIGGIDHSLYTGRLWYTPIRREWYYEVIIVRVEINGQDLKM 
DCKEYNYDKSIVDSGTTNLRLPKKVFEAAVKSIKAASSTEKFPDGFWLGEQLVCWQAG 
TTPWNIFPVISLYLMGEVTNQSFRITILPQQYLRPVEDVATSQDDCYKFAVSQSSTGT 
VMGAVIMEGFYWFDRARKRI GFAVSACHVHDEFRTAAVEGP FVTADMEDCGYNI PQT 
DESTLMTIAYVMAAICALFMLPLCLMVCQWRCLRCLRHQHDDFADDISLLK" 

BASE COUNT 964 a 1118 c 1066 g 953 t 

ORIGIN 


Query Match 75.0%; Score 1552.4; DB 11; Length 4101; 

Best Local Similarity 87.6%; Pred. No. 5.1e-192; 

Matches 1800; Conservative 0; Mismatches 241; Indels 13; Gaps 9; 


Qy 


1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 
I I I I I I I III I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I M 


Db 4 47 ATGGCCCCAGCGCTGCACTGGCTCCTGCTATGGGTGGGCTCGGGAATGCTGCCTGCCCAG 506 

Qy 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 120 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 1111(111 

Db 507 GGAACCCATCTCGGCATCCGGCTGCCCCTTCGCAGCGGCCTGGCAGGGCCACCCCTGGGC 566 

Qy 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 180 

III I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I 

Db 567 CT GAG G CT GC CC C GGGAGAC C GAC GAGGAAT C GGAGGAG C CT GGC C GGAGAGGC AG CT TT 62 6 

Qy 181 GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGATGACC 24 0 

I II I I II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I M I I I I I I I I I I I I I I I 

Db 627 GT GGAGAT G GTGGACAAC CT GAGG GGAAAGT C C GGC CAGGGCT ACT AT GT GGAGAT GAC C 68 6 

Qy 241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 300 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I i I I I I 

Db 687 GTAGGCAGCCCCCCACAGACGCTCAACATCCTGGTGGACACGGGCAGTAGTAACTTTGCA 74 6 

Qy 301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 747 GTGGGGGCTGCCCCACACCCTTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 8 06 

Qy 361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 42 0 

II II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 8 07 TATCGAGACCTCCGAAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAGGGGGAA 866 

Qy 421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 4 80 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 

Db 8 67 CTGGGCACCGACCTGGTGAGCATCCCTCATGGCCCCAACGTCACTGTGCGTGCCAACATT 926 

Qy 481 GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATCCTG 54 0 

I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I 

Db 927 GCTGCCATCACTGAATCGGACAAGTTCTTCATCAATGGTTCCAACTGGGAGGGCATCCTA 986 

Qy 541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 987 GGGCTGGCCTATGCTGAGATTGCCAGGCCCGACGACTCTTTGGAGCCCTTCTTTGACTCC 104 6 

Qy 601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I MINIM II I I I I I II I I I I I I I ■ 

Db 104 7 CTGGTGAAGCAGACCCACATTCCCAACATCTTTTCCCTGCAGCTCTGTGGCGCTGGCTTC 1106 

Qy 661 CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGGTATC 72 0 

I I I I I I I I I I I I I II I M II II I I M II I i II I IS II M I | | | | | | 

Db 1107 CC CCT CAACCAGACCGAGGCACTGGCCT C GGT GGGAGGGAGCAT GAT CATT GGTGGTAT C 1166 

Qy 721 GAC CAC T C GCT GT ACAC AGGCAGT CTCT GGT AT AC AC C CAT CC G G C GG GAGT GGT AT TAT 780 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 

Db 1167 GACCACTCGCTATACACGGGCAGACTCTGGTACACACCCATCCGGCGGGAGTGGTATTAT 1226 

Qy 781 GAGGT CAT CATTGTGCGGGT GGAGATCAATGGACAGGAT CTGAAAAT GGACTGCAAGGAG 84 0 

II II I I I I I I I I II I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I 

Db 1227 GAAGT GAT CAT T GT AC GT GT GGAAAT CAAT GGT CAAGAT CT CAAGAT GGACT GCAAGGAG 12 8 6 

Qy 841 T ACAACT AT GACAAGAGCAT T GT GGAC AGT GGCAC CAC CAACC T TCGTTTGCC CAAGAAA 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I M II II M II I I I I I I I I I | I I I 

Db 1287 T ACAACT AC GACAAGAGCATT GT GGAC AGT GGGAC CAC CAACC T TCGCTTGCC CAAGAAA 1346 


Qy 901 GTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCTGAT 960 

II I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I II II I II I I I I I I I I I III 
Db 1347 GT ATTT GAAG CT GC C GT CAAGT C CAT CAAGGCAGC CT C CT CGAC G GAGAAGTT C C C GGAT 1406 

Qy 961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 1020 

II M MINIM II II I II II I I I I II I I I II I II I I I M I I I I I II II II I II I I 
Db 14 07 GGCTTTTGGCTAGGGGAGCAGCTGGTGTGCTGGCAAGCAGGCACGACCCCTTGGAACATT 14 66 

Qy 1021 T T C CC AGT CAT CT C ACT CT AC CT AAT GGGT GAG GT T AC CAAC CAGT C CTT CCGC AT CAC C 1080 

M I I II I I I II I I II I Mill II I I I I I I II II I I I I I I II II II II II II I II 
Db 14 67 TT C CC AGT CAT T T CACT T T AC CT CAT GGGT GAAGT CAC CAAT CAGT C CT T CCGCAT CAC C 1526 

Qy 1081 AT C CT T C C GC AGCAAT AC CT GC GGC CAGT G GAAGAT GT GGC C AC GT C C CAAGACGACT GT 1140 

I M M II I I II II I II II I Mill II I I I II II II I II I II I II II II II II II I 

Db 1527 AT CCT T C CT C AGCAAT AC CT AC GGC CGGT GGAGGAC GT GGC CAC GT C C CAAGAC GACT GT 1586 

Qy 1141 T ACAAGT T T GC CAT CT CAC AGT CAT CCAC G GGC ACT GTT AT GGGAGC T GTT AT CAT GGAG 1200 

M II II II II I II II II I II I II II II II II II II II I II I I I I II II II I I II 
Db 1587 T ACAAGT T C G CT GTCT CAC AGT CAT CCAC GGGC ACT GTT AT GGGAG C CGT CAT CAT GGAA 1646 

Qy 1201 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1260 

M II II I II Mill II I II II II II II II II II II I II I M II M I II II I II I 
Db 1647 GGTTTCTATGTCGTCTTCGATCGAGCCCGAAAGCGAATTGGCTTTGCTGTCAGCGCTTGC 1706 

Qy 1261 CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGACATG 132 0 

II II I II M I I II II M II I I II II I M II II I I II II II I II II II M M 

Db 17 07 CAT GT GCAC GAT GAGT T CAGGAC GGCG GC AGT GGAAGGT C C GT T T GT T ACGG C AGACAT G 1766 

Qy 1321 GAAGACT GT G GCTACAACAT T C CACAGAC AGAT GAGT CAAC CCT CAT GACCAT AGC CT AT 1380 

II II II I II II II II I II I I II I I M II II I M II II II I II II II II II II II II I 
Db 1767 GAAGACT GT GGCTACAACATT CCCCAGACAGAT GAGTCAACACTT AT GACCAT AGCCTAT 1826 

Qy 1381 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 1440 

I II II II I II II I II II II M M II I I M I II II II II I II I I M I M II II I II II 

Db 1827 GTCATGGCGGCCATCTGCGCCCTCTTCATGTTGCCACTCTGCCTCATGGTATGTCAGTGG 18 86 

Qy 1441 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 

M II M II II I II II II II II II II I I II II II II II II I II II II II II I I I I I 
Db 18 87 CGCTGCCTGCGTTGCCTGCGCCACCAGCACGATGACTTTGCTGATGACATCTCCCTGCTC 194 6 

Qy 1501 AAGT GAGGAG GC CC AT GGG C AGAAGAT AGAGAT T C C CCT GGACCACAC CT C C GT GGT T C A 1560 

II II M II II I II II II I I II III I II I II II II II M I I I II I I II II I 

Db 1947 AAGT AAGGAGGC CC GT GGGC AGAT GAT GGAGAC GC CCCT GGAC CAC AT CT GGGT GGT T C C 2006 

Qy 1561 CTT T GGT CAC A — AGT AGGAGACACAGAT G G CAC CT GT GGC C AGAGC ACCT CAGGAC CCT 1618 

I II II II II II III MM I Mill II I I II II I I I II II II II I II I I II II 
Db 2 007 CT T T GGT CAC AT GAGT T GGAGCT AT G GAT GGT AC CT GT GGC C AGAGCAC CT CAG GAC CCT 2066 

Qy 1619 CCCCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCC 1678 

I M I M I I II II I I I I I M I III II II I II I I II II I II I 
Db 2 067 CAC CAAC CT GCCAAT GCTTCTGGCGT GACAGA- AC AGAGAAAT C AGGCAAGCT GGAT T AC 2125 

Qy 167 9 AGGGACTGTACCTGTAGGAAACAGAAAAGAGAAGAAAGAAGCACTCTGCTGGCGGGAATA 1738 

MM I I II II II II II II II I II | | || | | | || I II II I II 

Db 212 6 AGGGCTTGCACCTGTAGGACACAGGAGAGGGAAGGAAGCAGCGTTCTGGTGGCAGGAATA 2185 


Qy 

Db 


1739 
2186 


CTCTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAAC 17 98 

Ml I I I I I I I I I II Ml I I II I M M I I I M I M I II II I I M I I II I 

TCCTTAGACACCACAAACTTGAGT-TGGAAATTTTGCTGCTTGAAGCTTCAGCCCTGACC 2244 


Qy 1799 CTTTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAG 18 58 

II I I I I I I I M I I II I I M I M I II I I II I I I M II I III I I I I I 

Db 2245 CTCTGCCCAGCA-TCCTTTAGAGTCTCCT^CCTAAAGTATTCTTTATGTC CTTCCAG 2300 

Qy 1859 AAGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGA 1918 

M I I I I M M I I I I I II II I I I I III I I I I M II I I I I I I I I I I I I II || II I 
Db 2301 AAGTACTGGCGTCATACTCAGGCTACC-CGGCATGTGTCCCTGTGGTACCCTGGCAGAGA 2359 

Qy 1919 AGAGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCTA 197 8 

I I I I II I I I II I I I I I I I II I II I I I II I I I M I II II I I II I I 

Db 2360 AAG GGC CAAT C - T CAT T CC C T GCT GGC CAAAGT C AGCAGAAGA — AAGT GAAGT T T G C C A 2416 

Qy 1979 TTTGCTTTAGAGACAGGGACTGTATAAACAAGCCTAACATTGGTGCAAAGATTGCCTCTT 2038 

I I I M I I I I II I II I I I M I I I I I I I I I II I I I II I II II I I IE I II I 
Db 2417 GTTGCTTTAGTGATAGGGACTGCAGACTCAAGCCT-ACACTGGTACAAAGACTGCGTCTT 2475 

Qy 2039 GAATTAAAAAAAAA 2052 

II II II I I I I 

Db 2476 GAGATAAACAAGAA 2489 
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Location/Qualifiers 

1. .3805 

/organism="Mus musculus" 
/mol_type="mRNA" 
/strain="C57BL/6J" 
/db_xref= n FANTOM_DB:C230026O08 n 
/db_xref="taxon: 10090" 
/clone= ,, C23002 6O08" 
/tissue_type= " cerebellum" 

/ clone__lib=" RIKEN full-length enriched mouse cDNA library" 
/dev_stage="0 day neonate" 
448. .1955 

/note="beta-site APP cleaving enzyme (MGD | MGI : 134 6542 , 
GB|NM_011792, evidence: BLASTN, 98%, match^3874) 
putative" 
862 a 1068 c 1021 g 854 t 


Query Match 73.9%; Score 1530.4; DB 11; 

Best Local Similarity 87.5%; Pred. No. 3.7e-189; 
Matches 1800; Conservative 0; Mismatches 241; Indels 


Length 3805; 

15; Gaps 


11; 


Qy 


Db 


448 


ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 
I I I I I I I III I I I I I I I I I I I I I I I I III I I I I I Mill I I I I I I I I I I I I I 
ATGGCCCCAGCGCTGCACTGGCTCCTGCTATGGGTGGGCTCGGGAATGCTGCCTGCCCAG 507 


Qy 

Db 

Qy 

Db 

Qy 

Db 


61 


508 


121 


568 


181 


628 


GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 12 0 

II I I I I I I I I I I I I I I I II I I I III I I I I I I I I I I I I I II I I I I I I I I I 
GGAACCCATCTCGGCATCCGGCTGCACCTTCGCAGCGGCCTGGCAGGGCCACCCCTGGGC 567 

CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 18 0 

III I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
CTGAGGCTGCCCCGGGAGACCGACGAGGAATCGGAGGAGCCTGGCCGGAGAGGCAGCTTT 627 

GT GGAGAT GGT G GACAAC CT GAGGGGCAAGT C G GGGCAGG GCT ACT AC GT GGAGAT GAC C 24 0 

I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 

GT GGAGAT GGT G GACAAC CT GAG GGGAAAGT C C GGCCAGGGCT ACT AT GT GGAGAT GAC C 687 


Qy 

Db 

Qy 

Db 


241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 300 
II I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I 

688 GT AGGCAGC C C C C CACAGAC GCT CAAC AT C CT G GT GGAC AC G GGC AGT AGTAACTT T GC A 74 7 

301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I 
748 GTGGGGGCTGCCCCACACCCTTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 807 


Qy 


361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCA-GGGCAAGTGGGAAGGGGA 419 


II II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 8 08 TATCGAGACCTCCGAAAGGGTGTGTATGTGCCCTACACCCAGGGGCAAGTGGGAGGGGGA 867 

Qy 420 GCTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACAT 479 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 8 68 ACTGGGCACCGACCTGGTGAGCATCCCTCATGGCCCCAACGTCACTGTGCGTGCCAACAT 927 

Qy 480 T GCT GCC AT C ACT GAAT C AGACAAGT TC T T CAT C AAC GGCTC CAACT GGGAAGG C AT C CT 539 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I 1 I I I I I I I I I I 

Db 92 8 TGCTGCCATCACTGAATCGGACAAGTTCTTCATCAATGGTTCCAACTGGGAGGGCATCCT 987 

Qy 540 GGGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTC 599 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 
Db 988 AGG GCT GGC CT AT GCT GAGATT G C C AGGC C C GACGAC T CT TT G GAGC CCT T C T T T GACT C 104 7 

Qy 600 TCTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTT 659 

Mill MINIMUM I I I I II I I I I 1 I I I I I I I I I II I I I I I I I I I I I I I 

Db 1048 CCTGGTGAAGCAGACCCACATTCCCAACATCTTTTCCCTGCAGCTCTGTGGCGCTGGCTT 1107 

Qy 660 C C C C CT CAAC CAGT CT GAAGT GCTGGCCT CT GT C GGAGGGAGC AT GAT CAT T GGAGGT AT 719 

I I II I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I Mill 

Db 1108 CCCCCTCAACCAGACCGAGGCACTGGCCTCGGTGGGAGGGAGCATGATCATTGGTGGTAT 1167 

Qy 720 CGACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTATTA 77 9 

II M M II I I M Mill I I I I I M I I M I II II I I I M I I I I I M II I I I I II M I I 

Db 1168 CGACCACTCGCTATACACGGGCAGTCTCTGGTACACACCCATCCGGCGGGAGTGGTATTA 1227 

Qy 780 T GAGGT CAT CATT GT GC GGGT GGAGAT CAAT GGAC AG GAT CT GAAAAT GGACT GCAAGGA 839 

III II M I I I I I I II I I I I I I II II I I I M I I II I II I I I I I I I I I I I I I I 

Db 1228 T GAAGT GAT CATT GT AC GTGT GGAAAT CAAT GGT CAAGAT CT CAAGAT GGACT GCAAGGA 12 87 

Qy 84 0 GT ACAACT AT GACAAGAGC ATT GT GGACAGT GGCAC C AC CAACCTT CGT T T G C C CAAGAA 899 

I M II M I I I I I I I I II M I I I I M M II II I I II I M II M I I I I I I I II M I I I I 

Db 1288 GT ACAACT AC GACAAGAGC ATT GT GGACAGT GGGAC C AC CAAC CT T CGCT T GC C CAAGAA 1347 

Qy 900 AGT GTT T GAAGCT GC AGT CAAAT C CAT CAAG GCAGC CT C CT C C AC GGAGAAGT T C C CT GA 959 

Ml I M II I II II I I II I I I I I I II I II I I II II II I II M I I I II I I II I II II 
Db 134 8 AGTATTTGAAGCTGCCGTCAAGTCCATCAAGGCAGCCTCCTCGACGGAGAAGTTCCCGGA 14 07 

Qy 960 TGG-TTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACA 1018 

III II I I I II I I I M I II II I I II II II I II I I I I II II I I I I I I II I I M I I I I 
Db 1408 TGGCTTTTGGGCTAGGGGAGCAGCTGGTGTGCTGGCAAGCAGGCACGACCCCTTGGAACA 1467 

Qy 1019 T T T T CC C AGT C AT CT CACT CT AC CT AAT GGGT GAGGTT AC CAAC CAGT C CT T C C GC AT CA 107 8 

I I II I I I M I I I I I I I I I I II I I I I I I I II I II I I I I I I I I I I I II I II I I I I I 

Db 1468 TTTTCC CAGT CATTTCACTTTACCT CAT GGGT GAAGT CACCAATCAGTCCTTCCGCATCA 1527 

Qy 1079 C CAT CC T T C C GC AGCAAT AC CT G C G GC CAGT GGAAGAT GT GGC C AC GT C C CAAGAC GACT 1138 

I I II I I I M I I II I I I I I I I I I I I I I I II II II I II II I I I I I I I II I I II M I I 
Db 1528 C CAT CC T T C CT C AGCAAT AC CT AC GGC C G GT GGAG GAC GT GGC C AC GT C C CAAGAC GACT 1587 

Qy 1139 GTTACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATCATGG 1198 

I I I I II I I II II I I I II I I I I II I I II I I I I I I I II I I I I I I I I I I II I I II I I I 
Db 1588 GTTACAAGTTCGCTGTCTCACAGTCATCCACGGGCACTGTTATGGGAGCCGTCATCATGG 1647 

Qy 1199 AGGGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTT 1258 

I II I I I I I II Mill I I I I I I I II I I I I II II II I M I I I I M I I I I I II I M 


Db 


164 8 AAGGTTTCTATGTCGTCTTCGATCGAGCCCGAAAGCGAATTGGCTTTGCTGTCAGCGCTT 17 07 


Qy 1259 GCCATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGACA 1318 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I II I I II I I I I 

Db 1708 GCCATGTGCACGATGAGTTCAGGACGGCGGCAGTGGAAGGTCCGTTTGTTACGGCAGACA 1767 

Qy 1319 T GGAAGACT GT G G CT ACAACATT C C AC AGACAGAT GAGT CAAC CCT CAT GACC AT AGC CT 137 8 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I || I I I I I I I I I I I I I 
Db 17 68 T GGAAGACT GTGGCTACAACATT C CC CAGACAGATGAGTCAACACTTAT GACCATAGC CT 1827 

Qy 137 9 ATGTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGT 14 38 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I 
Db 182 8 ATGTCATGGCGGCCATCTGCGCCCTCTTCATGTTGCCACTCTGCCTCATGGTATGTCAGT 18 87 

Qy 1439 GGCGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGC 14 98 

I I I I I I I I I I II I I I II. I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 18 8 8 GGCGCTGCCTGCGTTGCCTGCGCCACCAGCACGATGACTTTGCTGATGACATCTCCCTGC 1947 

Qy 14 99 T GAAGT GAGGAG G C C CAT GG G CAGAAGAT AGAGATT C CC C T GGAC CAC AC CT C CGT GGTT 1558 

I I I I I I I I I I I I I I I I II I I I I III I I I I II I I I I I I II I I I II I I I I I I 
Db 1948 T CAAGTAAGGAGGC C C GT GGGC AGAT GAT GGAGACGC C C CT GGAC CAC AT CT GGGT GGTT 2007 

Qy 1559 CACTTTGGTCACA — AGTAGGAGACACAGATGGCACCT GT GGCCAGAGCACCT CAGGACC 1616 

I I I I I I I I I I I I III I I I 1 I Mill I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2 008 CCCTTTGGTCACATGAGTTGGAGCTATGGATGGTACCTGTGGCCAGAGCACCTCAGGACC 2067 

Qy 1617 CTCCCCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTT 1676 

I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I 

Db 2 068 CT CAC CAAC CT GC CAAT GCT T CT GGC GT GACAGA- AC AGAGAAAT C AGGCAAGCT GGATT 2126 

Qy 1677 C CAGGGACT GT AC CT GT AGGAAAC AGAAAAGAGAAGAAAGAAGCACT C T GCT GGC GGGAA 1736 

I I I I I I I I I I I I I I I I I I I I I I II I I I I III III I I I I I I I I I I I I 
Db 2127 ACAG GGC TT GC AC CT GT AGGACAC AGGAGAGG GAAG GAAGCAGC GT T CT G GT G GC AGGAA 218 6 

Qy 1737 TACTCTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGA 17 96 

II III I I I I I I I I I II III I I I I I I I I I I I I I I I M I I II I I I I I I I I I 

Db 2187 TATCCTTAGACACCACAAACTTGAGT-TGGAAATTTTGCTGCTTGAAGCTTCAGCCCTGA 22 45 

Qy 1797 ACCTTTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTC 18 56 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III III 
Db 224 6 CCCTCTGCCCAGCA-TCCTTTAGAGTCTCCAACCTAAAGTATTCTTTATGTC CTTCC 2301 

Qy 1857 AGAAGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGA 1916 

I I I II I I I I I I I III II I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I II I 

Db 2302 AGAAGTACTGGCGTCATACTCAGGCTACC-CGGCATGTGTCCCTGTGGTACCCTGGCAGA 2360 

Qy 1917 GAAGAGACCAAGCTT GTTT CC CT GCTGGCCAAAGT CAGTAGGAGAGGAT GCACAGTTT GC 197 6 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 

Db 2361 GAAAGGGC CAAT C - T CAT T C C CT GC T GGC CAAAGT C AG C AGAAGA — AAGTGAAGTTTGC 2417 

Qy 1977 TAT T T GCT TT AGAGAC AGGGACT GT AT AAACAAGC CT AAC ATT GGT G CAAAGATT GC CT C 2036 

I I I I M I I I I II I I I I I I I I I I I I I I I II III MM I II I I I III II 
Db 2 418 CAGTT GCTTTAGT GAT AGGGACT GCAGACT CAAGCCT - ACACTGGTACAAAGACT GC GTC 2476 

Qy 2037 T T G AAT T AAAAAAAAA 2052 

II I I I I I I I I I I 

Db 2477 T T G AG AT AAAC AAG AA 2492 
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1. .3880 

/organism="Mus musculus" 

/mol_type="mRNA" 

/strain="C57BL/6J" 

/ db_x r e f = " FANTOM_DB : A7 3 0 0 5 9 K0 8 " 

/db_xref="taxon: 10090" 

/clone="A730059K08" 

/ tissue_type="cerebellum" 

/clone_lib="RIKEN full-length enriched mouse cDNA library" 
/dev_stage="7 days neonate" 
450. .1957 

/note="beta-site APP cleaving enzyme (MGD I MGI : 1346542 , 


GB|NM_011792, evidence: BLASTN, 98%, match=3874) 
putative" 

BASE COUNT 879 a 1090 c 1031 g 880 t 
ORIGIN 

Query Match 73.1%; Score 1514.2; DB 11; Length 3880; 

Best Local Similarity 87.4%; Pred. No. 4.6e-187; 

Matches 17 97; Conservative 0; Mismatches 243; Indels 17; Gaps 12; 

Qy 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGG — CGCGGGAGTGCTGCCTG-CC 57 

I I I I I I I III I I I I I I I I I I I II I I I III MM M I I I I I M I I I II 

Db 450 ATGGCCCCAGCGCTGCACTGGCTCCTGCTATGGGTGGGGCTCGGGAAATGCTGCCTGCCC 509 

Qy 58 CACGGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTG 117 

II II Mill I I II I M I I I I M I I I I I I I I I I I I I II II I i I I I II I I I 

Db 510 CAGGGAACCCATCTCGGCATCCGGCTGCCCTT-CGCAGCGGCCTGGCAGGGCCACCCCTG 568 

Qy 118 GGGCTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGC 177 

II Ml II II II I I II I I I I II II M I I II I II I I II II I I I II I I I I I I I II 
Db 569 GGCCTGAGGCTGCCCCGGGAGACCGACGAGGAATCGGAGGAGCCTGGCCGGAGAGGCAGC 62 8 

Qy 17 8 TTTGTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGATG 237 

I I II I I I II II II I II I I I M I II II I II II I I I II II II I II I I II I II I II I I I 
Db 62 9 TT T GT GGAGAT GGT G GACAAC CT GAGGGGAAAGT C C GG C CAGGGCT AC TAT GT GGAGAT G 688 

Qy 238 ACCGTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTT 2 97 

I I I I I II II II I I II I I II I I I II II I I II I II I I I I I I II Mill I II I I I II I 
Db 68 9 ACCGTAGGCAGCCCCCCACAGACGCTCAACATCCTGGTGGACACGGGCAGTAGTAACTTT 74 8 

Qy 2 98 GCAGTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGC 357 

I I II I I II II I I I II I I II I I II II I I I II II I I I II II II II I I II I I I I I I I I I I 
Db 74 9 GCAGTGGGGGCTGCCCCACACCCTTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGC 8 08 

Qy 358 ACATACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGG 417 

I I II I II I II I II II II I I I I I II I I I I I I II I I I I II II I I I II I I I II I I I III 

Db 8 09 ACATATCGAGACCTCCGAAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAGGGG 8 68 

Qy 418 GAGCTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAAC 4 77 

II I I I II II II I I II II I I I I I I I I II I II II I I I II I I I II I I II I I I II I I I I I I 

Db 869 GAACTGGGCACCGACCTGGTGAGCATCCCTCATGGCCCCAACGTCACTGTGCGTGCCAAC 928 

Qy 478 ATT GC T GC C AT CACT GAAT CAGACAAGT T CT T CAT CAAC GGCT C CAACT GGGAAGGC AT C 537 

M I II II I II II I I I I II I I II I II II I II II II I I I II II I I I I I II I I II I I II 
Db 92 9 ATT GCT GCC AT CACT GAAT C GGACAAGT T CTT CAT CAAT GGT T C CAACT GGGAGGGCAT C 98 8 

Qy 538 CTGGGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGAC 5 97 

II I I I I II I M I I I II II II I I I I I I I I II I I II II I I I I I I I II I I I I II I I I I 
Db 98 9 CTAGGGCTGGCCTATGCTGAGATTGCCAGGCCCGACGACTCTTTGGAGCCCTTCTTTGAC 104 8 

Qy 598 TCTCTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGC 657 

II I I I I I I I I I I I II II II I I I II II I II II I II I I II I II II I I I II I II I 
Db 104 9 TCCCTGGTGAAGCAGACCCACATTCCCAACATCTTTTCCCTGCAGCTCTGTGGCGCTGGC 1108 


Qy 

Db 


658 
1109 


717 
1168 


Qy 718 ATCGACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTAT 777 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 

Db 1169 ATC GAC C AC T C GCT AT AC AC G GG CAGT CT CT G GT ACAC AC C CAT C C G G C GGGAGT G GTAT 1228 

Qy 77 8 TAT GAGGTCAT CATT GTGCGGGTGGAGATCAATGGACAGGAT CT GAAAAT GGACT GCAAG 837 

I I I I I II II I I I I I I II I I I II I I I I I I I I II I I I I I II I I I I I I I I I I I I 

Db 1229 TAT GAAGT GAT C ATT GT AC GT GT GGAAAT CAAT GGT CAAGAT CT CAAGAT GGACT GCAAG 1288 

Qy 838 GAGT ACAACT AT GACAAGAGC AT T GT GGAC AGT GGCAC CAC CAAC CT T C GT TT GC C CAAG 8 97 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I 

Db 12 8 9 GAGT ACAACT AC GACAAGAGC AT T GT GGAC AGTGGGAC CAC CAACCT T CGCTTGCC CAAG 1348 

Qy 8 98 AAAGT GT T T GAAGCT GC AGTCAAAT C CAT CAAGGCAG C CT C CT C C AC GGAGAAGT T C C CT 957 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 

Db 134 9 AAAGT ATTTGAAGCTGCCGTCAAGTCCATCAAGGCAGCCTCCTCGACGGAGAAGTTCCCG 1408 

Qy 958 GATGGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAAC 1017 

I I I I I II I I II I I II I I I I I I II I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 14 09 GATGGCTTTTGGCTAGGGGAGCAGCTGGTGTGCTGGCAAGCAGGCACGACCCCTTGGAAC 14 68 

Qy 1018 AT T T T C CC AGT CAT CT C ACT C T AC CTAAT G GGT GAGGTTAC CAAC CAGT C CT T C C G CAT C 1077 

I I I I II I I I I I I II I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I 

Db 14 69 AT T TT C CC AGT CAT T T C ACT T T AC CT CAT GGGT GAAGT CAC CAAT CAGT C CT T C C GCAT C 1528 

Qy 107 8 AC CAT C CT T C C GCAGCAAT AC C T GC GGC C AGTGGAAGAT GT G GC C AC GT C CCAAGAC GAC 1137 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I II II I I I I I I I I I I I I 

Db 1529 AC CAT C CT T C CT CAGCAAT AC C T AC GGC C GGT GGAGGAC GT GGC CAC GT C C CAAGAC GAC 1588 

Qy 1138 T GT T ACAAGT T T GC CAT CT CAC AGT CAT C C ACGGGC ACT GT TAT GGGAGCTGTT AT CAT G 1197 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 

Db 158 9 TGTTACAAGTTCGCTGTCTCACAGTCATCCACGGGCACTGTTATGGGAGCCGTCATCATG 1648 

Qy 1198 GAGGGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCT 1257 

II II I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1649 GAAGGTTTCTATGTCGTCTTCGATCGAGCCCGAAAGCGAATTGGCTTTGCTGTCAGCGCT 1708 

Qy 1258 TGCCATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGAC 1317 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I II III 

Db 17 09 TGC C AT GT GCAC GAT GAGTT C AG GACGGC GGCAGT GGAAGGT CC GT T T GT T AC G GCAGAC 1768 

Qy 1318 AT G GAAGACT GT GGCT ACAAC AT T C C AC AGACAGAT GAGT CAAC C CT CAT GAC C AT AGC C 1377 

I I I I I I I I I I I I I I I M I I I II I I I I I I I I II I II I I I I I I I I II I I I I I I I I I I I I 

Db 1769 ATGGAAGACTGT GGCTACAACATT C CCCAGACAGAT GAGT CAACACTTAT GACCATAGCC 1828 

Qy 1378 TATGTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAG 1437 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I M I II 

Db 182 9 TATGTCATGGCGGCCATCTGCGCCCTCTTCATGTTGCCACTCTGCCTCATGGTATGTCAG 1888 

Qy 1438 TGGCGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTG 1497 

I I I I I I I I II I II I I I I I I II I I I I II I I I I I I I II I M I I I I I I I I I I I I I I I I I 

Db 18 89 TGGCGCTGCCTGCGTTGCCTGCGCCACCAGCACGATGACTTTGCTGATGACATCTCCCTG 1948 

Qy 14 98 CT GAAGTGAGGAGGC C C AT GGGCAGAAGATAGAGAT T CC C CT GGAC C ACACCT CC GT GGT 1557 

II I I I I I II I I I I I I I I I I I I I I III I I II I I I I I I I I I I I I I II I I I I I 

Db 194 9 CTCAAGTAAGGAGGCCCGTGGGCAGATGATGGAGACGCCCCTGGACCACATCTGGGTGGT 2 008 

Qy 1558 T C ACT T T G GT CACA — AGTAGGAGAC AC AGAT GGCAC CT GT G GCC AGAG C AC CT C AGGAC 1615 


II 1 1 1 1 1 1 1 1 1 1 1 III II 1 1 I 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 2 009 TCCCTTTGGT C AC AT GAGTT GGAGCTAT G GAT GGT AC CT GT GGCCAGAGCACCT CAGGAC 2 068 

Qy 1616 CCTCCCCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGT 1675 

I I II I I I I I I I I I I I I I I I I I I I III I I I I I I I I I II I I I I 

Db 2 069 CCT CACCAACCTGC CAATGCTT CT GGCGT GACAGA- ACAGAGAAATCAGGCAAGCT GGAT 2127 

Qy 1676 T CCAGGGACTGTACCT GTAGGAAACAGAAAAGAGAAGAAAGAAGCACT CTGCT GGCGGGA 1735 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 212 8 TACAGGGCTTGCACCTGTAGGACACAGGAGAGGGAAGGAAGCAGCGTTCTGGTGGCAGGA 2187 

Qy 1736 ATACTCTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTG 17 95 

Ml III I MM MM II III I I I II I I II I I II I I II I I M II I I II I I 

Db 218 8 ATATCCTTAGACACCACAAACTTGAGT-TGGAAATTTTGCTGCTTGAAGCTTCAGCCCTG 224 6 

Qy 1796 AACCTTTGTCCACCATTCCTTTAAATTCTCCAACCC7WVGTATTCTTCTTTTCTTAGTTT 1855 

I I I I II I I I I I I I I I I I I I I II II II II II I II II II I I III II 
Db 2247 AC C CT C T GC CC AGCA- T C CT T T AGAGT CT C CAACCTAAAGT ATT CT T TAT GT C CTTC 2302 

Qy 1856 CAGAAGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAG 1915 

I I II I I M II I I I III II I I I I I I I I III I II I I II I II I I I II I I II I I II I 

Db 2303 CAGAAGTACTGGCGTCATACTCAGGCTACC-CGGCATGTGTCCCTGTGGTACCCTGGCAG 2361 

Qy 1916 AGAAGAGACCAAG CT T GTTTCCCTGCT GGC CAAAGT C AG TAG GAGAGGAT GCACAGT TT G 1975 

II II I I I I I I I II I I I II I II I II I I II I I I I II II I II I I II I I 

Db 2 362 AGAAAGGGC CAAT C - T CAT T C C CT GCT GGC CAAAGT C AGCAGAAGA — AAGT GAAGT T T G 2418 

Qy 197 6 CT AT TT GCT TT AGAGAC AGGGACT GT ATAAACAAGC CT AACATT GGT G CAAAGAT T GC CT 2 035 

I I II I II II II II II II I I II I I I I II I I I III I II I II II I I III I 

Db 2419 CCAGTTGCTTTAGT GAT AGGGACT GCAGACT CAAGCCT - ACACT GGTACAAAGACT GCGT 2477 

Qy 2036 CTTGAATTAAAAAAAAA 2052 

II II I I I I I I I I I 

Db 2 47 8 CT T GAGAT AAAC AAGAA 24 94 
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FEATURES Location/Qualifiers 
source 1. .4048 

/organism="Mus musculus" 

/ mo l_jt yp e - "mRNA " 

/strain="C57BL/6J n 

/db_xref="FANTOM_DB:C230037E16" 

/db_xref="taxon: 10090" 

/clone="C230037E16" 

/tissue_type- M cerebellum" 

/clone__lib="RIKEN full-length enriched mouse cDNA library" 
/dev_stage="0 day neonate" 
CDS 451. .1854 

/note= "unnamed protein product; beta-site APP cleaving 

enzyme (MGD | MGI : 134 6542 , GB | NM_011792 , evidence: BLASTN, 

98%, match=3874) 

putative" 

/codon_start-l 

/protein_id="BAC38462 . 1" 

/db_xref="GI:26349645" 

/ trans la tion= "MAP ALHWLLLWVGSGMLPAQGTHLGIRLPLRSGLAGPPLGLRLP 
RETDEESEEPGRRGSFVEMVDNLRGKSGQGYYVEMTVGSPPQTLNILVDTGSSNFAVG 
AAPHPFLHRYYQRQLSSTYRDLRKGVYVPYTQGKWEGELGTDLVSIPHGPNVTVRANI 
AAITESDKFFINGSNWEGILGLAYAEIARPDDSLEPFFDSLVKQTHIPNIFSLQLCGA 
GFPLNQTEALASVGGSMIIGGIDHSLYTGSLWYTPIRREWYYEVIIVRVEINGQDLKM 
DCKETEKFPDGFWLGEQLVCWQAGTTPWNIFPVISLYLMGEVTNQSFRITILPQQYLR 
P VE DVAT S Q DDC YK FAVS Q S S T GT VMGAVTMEG F YVVFDRARKRI G FAVS ACHVHDE F 
RTAAVEGPFVTADMEDCGYNIPQTDESTLMTIAYVKAAICALFMLPLCLMVCQWRCLR 
CLRHQHDDFADDISLLK" 

BASE COUNT 952 a 1103 c 1045 g 948 t 

ORIGIN 


Query Match 65.1%; Score 1347.2; DB 11; Length 4048; 

Best Local Similarity 82.6%; Pred. No. 1.7e-165; 

Matches 1697; Conservative 0; Mismatches 243; Indels 114; Gaps 9; 


Qy 


1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 


1 1 1 1 1 1 1 III 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 III 1 1 II I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 4 51 ATGGCCCCAGCGCTGCACTGGCTCCTGCTATGGGTGGGCTCGGGAATGCTGCCTGCCCAG 510 

Qy 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 120 

II Mill I I I I I I I I I I Ml I I I II II II I I I I I I I I I || | I M I I I II 
Db 511 GGAACCCATCTCGGCATCCGACTGCCCCTTCGCAGCGGCCTGGCAGGGCCACCCCTGGGC 57 0 

Qy 121 CTGCGGCTGCCCCGG GAGAC C GAC GAAGAGC C C GAGGAGC C C GG C C GGAGGG GC AGCTT T 18 0 

II I I II I I I II I I I I II I I I II I I I M I I I I II I I I I I I I I I I I I I M I II I I 
Db 571 CTGAGGCTGCCCCGGGAGACCGACGAGGAATCGGAGGAGCCTGGCCGGAGAGGCAGCTTT 630 

Qy 181 GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGATGACC 24 0 

I I I I I I I I II II M I II II II M I I I Mill II I I I II I II I I I I I I I II I I M I I 

Db 631 GT G GAGAT GGT G GACAAC C T GAGGGGAAAGT C CGGC CAG GGCT ACT AT GTG GAGAT GAC C 690 

Qy 241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 300 

II I I I I I I I I I M I I I I I I II I I II M I II I I I I II II Mill I I II I I I I I I I I 

Db 691 GT AGGCAGCC C C C C ACAGAC G CT CAAC AT C CT GGT GGACAC GGGCAGT AGTAACT T T GCA 750 

Qy 301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

I I I I I I I II II M I I II I II I II I I I I I II I I I I I II II I I I I II I I I I I I II I II I 

Db 751 GTGGGGGCTGCCCCACACCCTTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 810 

Qy 361 T AC C GGGACCT C C GGAAGGGT GT GTAT GT G C C CT AC AC C C AGG GCAAGT GG GAAGG GGAG 42 0 

II II I II II I II II I II II II I II I II I II I I II I II I I I I I I I I I I I I I I II I I 

Db 811 TATCGAGACCTCCGAAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAGGGGGAA 870 

Qy 421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 4 80 

II I I II II II II I II II II I II I I I I I II M I I I M I I I I I I I II I I II I I I I II II I 
Db 871 CTGGGCACCGACCTGGTGAGCATCCCTCATGGCCCCAACGTCACTGTGCGTGCCAACATT 930 

Qy 481 GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATCCTG 540 

II I I I II I II I I II I I I I I II I I I II II II I I II II I I II II I I I I I I I II I I II 
Db 931 GCTGCCATCACTGAATCGGACAAGTTCTTCATCAATGGTTCCAACTGGGAGGGCATCCTA 990 

Qy 541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

I I I I I II I II II II II II I I II II I II I I I II I I I I I I II I II I I I M I I II I I I 
Db 991 GGGCTGGCCTATGCTGAGATTGCCAGGCCCGACGACTCTTTGGAGCCCTTCTTTGACTCC 1050 

Qy 601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

Mill I II II I I I II I I I I I II I II II I I I I I M I I I II I I I I I II I I II II I 
Db 1051 CTGGTGAAGCAGACCCACATTCCCAACATCTTTTCCCTGCAGCTCTGTGGCGCTGGCTTC 1110 

Qy 661 CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGGTATC 720 

I M I I II I I II I I II I II II II I I II I I I I II I II I I M I II I II I II I I II 
Db 1111 CCCCTCAACCAGACCGAGGCACTGGCCTCGGTGGGAGGGAGCATGATCATTGGTGGTATC 1170 

Qy 721 GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTATTAT 7 80 

I I I I II I I II I II II I I I II II II I I I I I I I I I I II I I I I I I I I I I II I M II II I I 
Db 1171 GACCACTCGCTATACACGGGCAGTCTCTGGTACACACCCATCCGGCGGGAGTGGTATTAT 1230 

Qy 7 81 GAGGT CAT CAT T GT GC GGGT G GAGAT CAAT GGAC AGGAT CT GAAAAT GGACT GCAAGGAG 840 

M II II I I I I I I II II I II I II I II II M I II I I II II II II I I I I II I II 
Db 1231 GAAGT GAT CAT T GT AC GT GT GGAAAT CAAT GGT CAAGAT CT CAAGAT G GACT GCAAGGAG 1290 


Qy 


841 T ACAACT AT GACAAGAGC AT T GT GGACAGT GGC AC C AC CAAC CT T C GTTT GC C CAAGAAA 900 


Db 


1291 


1290 


Qy 901 GT GTT T GAAG CT G C AGT CAAAT C CAT CAAGGC AGC CT C CT C CAC GGAGAAGTT C C CT GAT 960 

I I I I I I I I I I I I I I III 

Db 12 91 AC G GAGAAGT T C C C G GAT 1308 

Qy 961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 1020 

II II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
Db 1309 GGCTTTTGGCTAGGGGAACAGCTGGTGTGCTGGCAAGCAGGCACGACCCCTTGGAACATT 1368 

Qy 1021 TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCACC 1080 

I I I I I I I I I I I I I I I I I I I I I MINIM || I I I | | I I I I I I I I I I I I I I I I I I 
Db 1369 TTCCCAGTCATTTCACTTTACCTCATGGGTGAAGTCACCAATCAGTCCTTCCGCATCACC 1428 

Qy 1081 ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGACTGT 1140 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I 
Db 142 9 AT C CT T CCT CAGCAAT AC C T AC G GC C GGT GGAGGAC GT GGC CAC GT C C CAAGAC GACT GT 14 88 

Qy 1141 TACAAGTTT GC CAT CT CAC AGT CAT CCACGGGCACT GTTATGGGAGCTGTTAT CATGGAG 1200 

I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 

Db 14 89 T ACAAGTT C GCT GT CT CACAGT C AT CC AC GG GC ACT GT T AT GGGAGC C GT CAT CAT G GAA 1548 

Qy 1201 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1260 

II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I M 

Db 154 9 GGTTTCTATGTCGTCTTCGATCGAGCCCGAAAGCGAATTGGCTTTGCTGTCAGCGCTTGC 1608 

Qy 12 61 CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGACATG 1320 

I I I I I I I I I I I I II I I I I I I I I I I I I II I I II I I I I II I I I I I II I I I I I I 

Db 1609 CAT GT GC AC GAT GAGT T CAGGAC GGCG G C AGTGGAAGGT C C GTT T GT T AC GGC AGAC AT G 1668 

Qy 1321 GAAGACT GT GG CT ACAACATT C CAC AGAC AGAT GAGT CAAC CCT CAT GAC CAT AGC CT AT 1380 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I 
Db 1669 GAAGACT GT GGCT ACAAC AT T C C C CAGACAGAT GAGT CAAC ACT TAT GAC CAT AG C CT AT 1728 

Qy 1381 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 1440 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I II I I I I I I I I M 
Db 172 9 GTCATGGCGGCCATCTGCGCCCTCTTCATGTTGCCACTCTGCCTCATGGTATGTCAGTGG 1788 

Qy 1441 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 

I I I I I I I I II I I I I I I I I I I I Mill I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 178 9 CGCTGCCTGCGTTGCCTGCGCCACCAGCACGATGACTTTGCTGATGACATCTCCCTGCTC 184 8 

Qy 1501 AAGTGAGGAGGCCCAT GGGCAGAAGAT AGAGATT CCCCT GGACCACACCTCC GT GGTT CA 1560 

I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I II I I I I II I 
Db 1849 7\AGTAAGGAGGCCCGTGGGCAGATGATGGAGACGCCCCTGGACCACATCTGGGTGGTTCC 1908 

Qy 1561 CTTTGGTCACA — AGTAGGAGACACAGATGGCAC CT GTGGCCAGAGCACCT CAGGACCCT 1618 

I I I I I I I I I M III I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
Db 1909 C T T T G GT CAC AT GAGT T GGAGCT AT G GAT GGT AC CT GT GGC CAGAGC AC CT C AGGACC CT 1968 

Qy 1619 CCCCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCC 1678 

I I I I I I I I I I I I I II I I I I I III I I I I I I II I I I I I I I I I 
Db 1969 CAC CAAC CT GC CAAT GCTTCTGGCGT GACAGA- ACAGAGAAAT C AG GCAAGC T GGAT T AC 2027 


Qy 

Db 


1679 
2028 


1738 
2087 


Qy 1739 CTCTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAAC 1798 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I II 
Db 2088 T C CT T AGACAC C ACAAACTT GAGT - T GGAAAT T TT G CT GCTT GAAGCTT CAGC C CCT GAC 214 6 

Qy 1799 CTTTGTCCACCATTCCTTTAAATTCTCCAACCC7\AAGTATTCTTCTTTTCTTAGTTTCAG 1858 

II II I I I II I I I I I I I I I I II I I I I I I I I I I I III I I I II 

Db 2147 CCTCTGCCCAGCATCCTTTAGAGTCTCCAACCTAAAGTATTCTTTATGTC CTTCCAG 2203 

Qy 1859 AAGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGA 1918 

I I I I I I I I I I III II I I I I MM Ml I II I II I I M I I I II I I I I I I II II II 
Db 22 04 AAGTACTGGCGTCATACTCAGGCTACC-CGGCATGTGTCCCTGTGGTACCCTGGCAGAGA 2262 

Qy 1919 AGAGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCTA 197 8 

I I I I I I I I I I I I I II I I I I II I II I I I II I I I II II II I II I I I 
Db 2263 AAGG GC C AAT C - T CAT T C CCT G CT GGC CAAAGT CAGC AGAAGA — AAGTGAAGTTTGCCA 2319 

Qy 1979 TTT GCTTTAGAGACAGGGACT GTATAAACAAGCCTAACATTGGT GCAAAGATT GCCT CTT 203 8 

I I I I I II I I II I I II II I I I I I I I I II I III II I I M II I I II I II I I 
Db 232 0 GTTGCTTTAGTGATAGGGACTGCAGACTCAAGCCT-ACACTGGTACAAAGACTGCGTCTT 237 8 

Qy 2039 GAATTAAAAAAAAA 2052 

M I I II I I II 
Db 2379 GAGATAAACAAGAA 2392 


RESULT 9 
AK079814 
LOCUS 

DEFINITION 


ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 


REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
MEDLINE 
PUBMED 

REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 


Craniata; Vertebrata; Euteleos tomi ; 
Sciurognathi ; Muridae; Murinae; Mus . 


AK079814 3147 bp mRNA linear HTC 05-DEC-2002 

Mus musculus 0 day neonate thymus cDNA, RIKEN full-length enriched 
library, clone : A430076B07 product : beta-site APP cleaving enzyme, 
full insert sequence. 
AK079814 

AK07 9814 . 1 GI: 2 609877 3 
HTC; CAP trapper. 
Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Rodentia; 
1 

Carninci,P. and Hayashizaki, Y. 

High-efficiency full-length cDNA cloning 

Meth. Enzymol. 303, 19-44 (1999) 

99279253 

10349636 

2 

Carninci,P., Shibata,Y., Hayatsu,N., Sugahara,Y., Shibata,K., 
Itoh,M., Konno,H., Okazaki,Y., Muramatsu,M. and Hayashizaki, Y. 
Normalization and subtraction of cap-trapper-selected cDNAs to 
prepare full-length cDNA libraries for rapid discovery of new genes 


Genome Res 
20499374 
11042159 
3 

Shibata, K. 
Konno, H . , 
Sumi, N. , 


10 (10), 1617-1630 (2000) 


, Itoh,M., 
Akiyama, J. , 


Aizawa, K. , 
Nishi, K. , 


Ishii,Y., Nakamura, S . , 


Nagaoka,S., Sasaki,N., Carninci,P. 
Kitsunai,T., Tashiro,H., Itoh,M., 
Hazama,M., Nishine,T., Harada,A. , 


Yamamoto, R. , Matsumoto, H . , Sakaguchi , S . , Ikegami,T., Kashiwagi , K. , 
Fujiwake, S . , Inoue,K. , Togawa,Y., Izawa,M. , Ohara,E., Watahiki,M. , 
Yoneda,Y., Ishikawa , T . , Ozawa,K., Tanaka,T., Matsuura,S. , Kawai,J., 
Okazaki,Y., Muramatsu, M. , Inoue,Y., Kira,A. and Hayashizaki , Y. 

TITLE RIKEN integrated sequence analysis (RISA) system — 384-format 

sequencing pipeline with 384 multicapillary sequencer 

JOURNAL Genome Res. 10 (11), 1757-1771 (2000) 

MEDLINE 20530913 
PUBMED 11076861 
REFERENCE 4 

AUTHORS Kawai,J., Shinagawa, A. , Shibata,K., Yoshino,M., Itoh,M., Ishii,Y., 
Arakawa,T., Hara,A., Fukunishi, Y . , Konno,H. , Adachi / J. / Fukuda,S., 
Aizawa,K., Izawa,M., Nishi,K., Kiyosawa,H., Kondo,S., Yamanaka,I., 
Saito,T., Okazaki,Y., Gojobori,T., Bono,H. f Kasukawa,T., Saito,R. , 
Kadota,K., Matsuda,H., Ashburner,M. , Batalov, S., Casavant,T., 
Fleischmann, W. , Gaasterland, T . , Gissi,C, King,B., Kochiwa,H., 
Kuehl,P., Lewises., Matsuo,Y. f Nikaido,I., Pesole,G., 
Quackenbush, J. , Schriml, L.M. , Staubli,F., Suzuki, R., Tomita,M., 
Wagner, L., Washio,T., Sakai,K., Okido,T., Furuno,M., Aono,H., 
Baldarelli , R. , Barsh,G., Blake, J., Boffelli,D., Bojunga,N., 
Carninci,P., de Bonaldo,M. F. , Brownstein,M. J. , Bult,C, 
Fletcher, C, Fujita,M., Gariboldi , M. , Gustincich, S . , Hill,D., 
Hofmann,M., Hume, D. A., Kamiya,M. , Lee, N . H . , Lyons, P., 
Marchionni, L. , Mashima,J., Mazzarelli , J. , Mombaerts , P . , Nordone,P., 
Ring,B., Ringwald,M., Rodriguez, I . , Sakamoto, N . , Sasaki, H., 
Sato,K., Schonbach,C. , Seya,T., Shibata,Y., Storch,K.F., Suzuki, H . , 
Toyo-oka,K., Wang,K.H., Weitz,C, Whittaker , C. , Wilming,L., 
Wynshaw-Boris, A. , Yoshida,K., Hasegawa,Y., Kawaji,H., Kohtsuki,S. 
and Hayashizaki, Y. 

TITLE Functional annotation of a full-length mouse cDNA collection 

JOURNAL Nature 409 (6821), 685-690 (2001) 

MEDLINE 21085660 
PUBMED 11217851 
REFERENCE 5 

AUTHORS The FANTOM Consortium and the RIKEN Genome Exploration Research 
Group Phase I & II Team. 

TITLE Analysis of the mouse transcriptome based on functional annotation 

of 60,770 full-length cDNAs 

JOURNAL Nature 420, 563-573 (2002) 
REFERENCE 6 (bases 1 to 3147) 

AUTHORS Adachi,J., Aizawa,K., Akimura,T., Arakawa,T., Bono,H., Carninci,P., 
Fukuda,S., Furuno,M., Hanagaki,T., Hara,A., Hashizume, W. , 
Hayashida, K. , Hayatsu,N., Hiramoto,K., Hiraoka,T., Hirozane,T. , 
Hori,F., Imotani,K., Ishii,Y., Itoh,M. , Kagawa,I., Kasukawa,T., 
Katoh,H., Kawai,J., Kojima,Y., Kondo,S., Konno,H., Kouda,M., 
Koya,S., Kurihara,C, Matsuyama, T . , Miyazaki,A., Murata,M. , 
Nakamura,M. r Nishi,K., Nomura, K. , Numazaki,R., Ohno,M., Ohsato,N., 
Okazaki,Y., Saito,R., Saitoh, H., Sakai,C, Sakai,K., Sakazume,N., 
Sano,H., Sasaki, D. , Shibata,K., Shinagawa , A. , Shiraki,T., 
Sogabe,Y., Tagami,M., Tagawa,A. , Takahashi, F. , Takaku-Akahira, S . , 
Takeda,Y., Tanaka,T., Tomaru,A. , Toya,T., Yasunishi, A. , 
Muramatsu,M. and Hayashizaki, Y. 

TITLE Direct Submission 

JOURNAL Submitted ( 1 6-APR-2002 ) Yoshihide Hayashizaki, The Institute of 
Physical and Chemical Research (RIKEN) , Laboratory for Genome 
Exploration Research Group, RIKEN Genomic Sciences Center (GSC) , 
RIKEN Yokohama Institute; 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, 


COMMENT 


FEATURES 

source 


misc feature 


BASE COUNT 
ORIGIN 


Kanagawa 230-0045, Japan ( E-mail : genome-res @gsc . riken . go . jp, 
URL :http: //genome. gsc. riken.go.jp/, Tel : 81-4 5-503-9222 , 
Fax:81-45-503-9216) 

cDNA library was prepared and sequenced in Mouse Genome 
Encyclopedia Project of Genome Exploration Research Group in Riken 
Genomic Sciences Center and Genome Science Laboratory in RIKEN. 
Division of Experimental Animal Research in Riken contributed to 
prepare mouse tissues. 

Please visit our web site for further details. 
URL : http : / /genome . gsc . riken . go . jp/ 
URL:http: // fantom.gsc. riken. go. jp/ . 

Location/Qualif iers 

1. .3147 

/organism="Mus mus cuius" 

/mol_type= M mRNA" 

/strain="C57BL/6J" 

/db_xref="FANTOM_DB:A430076B07" 

/db_xref="taxon: 10090" 

/clone="A4 3007 6B07" 

/ t is sue_type=" thymus" 

/clone_lib="RIKEN full-length enriched mouse cDNA library" 
/dev_stage="0 day neonate" 
1. .3147 

/note="beta-site APP cleaving enzyme (MGD I MGI : 1346542, 
GB|NM_011792, evidence: BLASTN, 98%, match=3874 ) " 
738 a 840 c 791 g 778 t 


Query Match 51.6%; Score 1068.4; DB 11; 

Best Local Similarity 86.3%; Pred. No. 2.1e-129; 
Matches 1286; Conservative 0; Mismatches 191; Indels 


Length 3147; 

13; Gaps 


9; 


Qy 


Db 


565 AGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCTCTGGTAAAGCAGACCCACGTTCCC 624 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
2 82 AGGCCCGACGACTCTTTGGAGCCCTTCTTTGACTCCCTGGTGAAGCAGACCCACATTCCC 341 


Qy 

Db 

Qy 

Db 

Qy 

Db 


625 AACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTCCCCCTCAACCAGTCTGAAGTGCTG 684 

Ml I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I II I III 
342 AACATCTTTTCCCTGCAGCTCTGTGGCGCTGGCTTCCCCCTCAACCAGACCGAGGCACTG 4 01 

685 GC C T CT GT C GGAGG GAGC AT GAT C ATT GGAGGTAT CGAC CACT C G CT GT ACACAGGC AGT 744 

I M I I M I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I M I 
4 02 GCCTCGGTGGGAGGGAGCATGATCATTGGTGGTATCGACCACTCGCTATACACGGGCAGT 4 61 

745 CTCTGGTATACACCCATCCGGCGGGAGTGGTATTATGAGGTCATCATTGTGCGGGTGGAG 8 04 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II II I I I I I I II I I I I I 
4 62 CT C T GGT AC ACAC C CAT CC GGC GGGAGT GGT ATT AT GAAGT GAT CATT GT AC GT GT GGAA 521 


Qy 

Db 

Qy 

Db 


8 05 AT CAAT GGACAGGAT CT GAAAAT GGACT GCAAGGAGT ACAACT AT GACAAGAGC AT T GT G 8 64 

M I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

522 AT CAAT GGT CAAGAT CT CAAGAT GGACT G CAAGGAGT ACAACT AC GACAAGAGC AT T GT G 581 

8 65 GAC AGT GG C AC C AC CAACCT T C GT T T GC CCAAGAAAGT GT TTGAAGCT G CAGT CAAAT CC 924 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I Mill III 

582 GACAGTGGGACCACCAACCTTCGCTTGCCCAAGAAAGTATTTGAAGCTGCCGTCAAGTCC 641 


Qy 


925 AT CAAG G CAGC CT C CT CC AC GGAGAAGT T C C CT GAT GGT T T CT GGCT AGGAGAG CAGCTG 984 


1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 642 ATCAAGGCAGCCTCCTCGACGGAGAAGTTCCCGGATGGCTTTTGGCTAGGGGAGCAGCTG 701 

Qy 985 GTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATTTTCCCAGTCATCTCACTCTACCTA 1044 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 7 02 GTGTGCTGGCAAGCAGGCACGACCCCTTGGAACATTTTCCCAGTCATTTCACTTTACCTC 761 

Qy 1045 ATGGGTGAGGTTACCAACCAGTCCTTCCGCATCACCATCCTTCCGCAGCAATACCTGCGG 1104 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I III 

Db 762 AT GGGT GAAGT CACCAAT CAGT C CTT CC GC AT C AC C AT CCT T C C T CAGCAAT AC CT AC GG 821 

Qy 1105 C CAGT GGAAGAT GT GGC C AC GT C C CAAGAC GACT GT T ACAAGT T T GC CAT CT CACAGT C A 1164 

II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I 

Db 822 C CGGT G GAGGAC GTGGC C AC GT C C CAAGAC GACT GT T ACAAGT T C G CT GT CT CACAGT CA 881 

Qy 1165 TCCACGGGCACTGTTATGGGAGCTGTTATCATGGAGGGCTTCTACGTTGTCTTTGATCGG 1224 

I I I I I I I II I I I I I I I I I I I I I I II II I I I I I I I I I I II I II I I I I I I I I I I 
Db 882 TCCACGGGCACTGTTATGGGAGCCGTCATCATGGAAGGTTTCTATGTCGTCTTCGATCGA 941 

Qy 122 5 GCCC GAAAAC GAATT GGCT T T GCT GT CAG C GCT T G C CAT GT G C AC GAT GAGT T C AGGAC G 1284 

I I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 942 GCCCGAAAGCGAATTGGCTTTGCTGTCAGCGCTTGCCATGTGCACGATGAGTTCAGGACG 1001 

Qy 1285 GCAGCGGTGGAAGGCCCTTTTGTCACCTTGGACATGGAAGACTGTGGCTACAACATTCCA 1344 

II M I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I II I I I 

Db 1002 GCGG C AGT GGAAGGT C C GT T T GT T AC G GC AGACAT GGAAGACT GT G GCT ACAAC AT T C C C 1061 

Qy 1345 CAGAC AGAT GAGTCAAC CCT CAT GAC C AT AGC CT AT GT CAT GGCT GC CAT CTGCGCCCTC 14 04 

I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1062 CAGAC AGAT GAGT CAACACT TAT GAC CAT AGC CT AT GT CAT GGC GGC CAT CT GCGCCCTC 1121 

Qy 14 05 TTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGGCGCTGCCTCCGCTGCCTGCGCCAG 14 64 

I I I I I I I I I I I I I I I I I II I I I I I I I II I II I I I I I I I I I I I II I I I I I I I I I I I 
Db 1122 TTCATGTTGCCACTCTGCCTCATGGTATGTCAGTGGCGCTGCCTGCGTTGCCTGCGCCAC 1181 

Qy 1465 CAGCAT GAT GACTTT GCTGAT GACAT CTCCCT GCT GAAGT GAGGAGGCCCAT GGGCAGAA 1524 

I I I M I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I 
Db 1182 CAG C AC GAT GACTT T GCT GAT GACAT CTCCCTGCT CAAGTAAGGAG GCC CGT GGGC AGAT 1241 

Qy 1525 GAT AGAGAT T C C CCT GGAC CACACCT CC GT GGT T C ACT T T GGT C ACA — AGTAGGAGACA 1582 

Ml I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I III I I I I I 
Db 1242 GATGGAGACGCCCCTGGACCACATCTGGGTGGTTCCCTTTGGTCACATGAGTTGGAGCTA 1301 

Qy 1583 CAGATGGCACCTGTGGCCAGAGCACCTCAGGACCCTCCCCACCCACCAAATGCCTCTGCC 1642 

IMM I I I I I II I I I I II II I I I I I I II I I I I I I III II I I I I II MM I 
Db 1302 TGGATGGTACCTGTGGCCAGAGCACCTCAGGACCCTCACCAACCTGCCAATGCTTCTGGC 1361 

Qy 164 3 TTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCAGGGACTGTACCTGTAGGAAACAG 17 02 

III III I I I I I I II I II II I II II I II II II I II II I II I I II 
Db 1362 GT GACAGA- ACAGAGAAAT CAGGCAAGCT GGATTAGAGGGCTT GCACCT GTAGGACACAG 1420 

Qy 1703 AAAAGAGAAGAAAGAAG C ACT CT GCT GGC G GGAAT ACT CTT GGT CACCT CAAAT TTAAGT 17 62 

I II II I I II I I I I I II I I II I II I I I I I I I I. I I I I I I I I II II I 
Db 1421 GAGAGGGAAGGAAGCAGCGTT CTGGT GGCAGGAATATCCTTAGACACCACAAACTT GAGT 1480 

Qy 17 63 CGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCTTTGTCCACCATTCCTTTAAATT 1822 

II M I I I I II I I I II I I I II M I I I I I I II III II III II I II II II I I 


Db 


14 81 -TGGAAATTTTGCTGCTTGAAGCTTCAGCCCTGACCCTCTGCCCAGCA-TCCTTTAGAGT 1538 


Qy 1823 CTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAAGTACTGGCATCACACGCAGGTT 1882 

I M I I I I I I I I I I I I I I I I III I I I I M I | | | | | | | | | | | | | | | | | | 

Db 1539 CTCCAACCTAAAGTATTCTTTATGTC CTTCCAGAAGTACTGGCGTCATACTCAGGCT 1595 

Qy 18 83 ACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAAGAGACCAAGCTTGTTTCCCTGCT 1942 

III III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | I I I I | I I I I 
Db 1596 ACC-CGGCATGTGTCCCTGTGGTACCCTGGCAGAGAAAGGGCCAATC-TCATTCCCTGCT 1653 

Qy 1943 GGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCTATTTGCTTTAGAGACAGGGACTGTA 2002 

I I I I I I I I I I I I I I I I I II I I I I I M I I I I I I I I I I I I I I I I I I M I 
Db 1654 GGCCAAAGT CAGCAGAAGA — AAGTGAAGTTTGCCAGTTGCTTTAGTGATAGGGACTGCA 1711 

Qy 2 003 T AAACAAGC C TAACAT T GGT GCAAAGAT TGCCTCTT GAATTAAAAAAAAA 2 052 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1712 GACTCAAGCCT-ACACTGGTACAAAGACTGCGTCTTGAGATAAACAAGAA 17 60 
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AF150387 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 


REFERENCE 
AUTHORS 

TITLE 
JOURNAL 
COMMENT 


FEATURES 
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BASE COUNT 
ORIGIN 


AF150387 1114 bp mRNA linear EST 22-JUN-1999 

AF150387 Human mRNA from cd34+ stem cells Homo sapiens cDNA clone 
CBMAPH03, mRNA sequence. 
AF150387 

AF15 038 7. 1 GI: 5133823 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 1114) 

Ye, M. , Zhang, Q.H., Zhou, J., Shen,Y., Wu,X.Y., Guan,Z.Q., Wang,L., 
Fan, H . Y . , Mao, Y . F . , Dai,M., Huang, Q.H., Chen, S.J. and Chen, Z. 
Human mRNA from cd34+ stem cells 
Unpublished 
Contact: Ye M 

Shanghai Institute of Hematology 

Shanghai Second Medical University, Rui-Jin Hospital 
197 Rui-Jin Road II, Shanghai, 200025, P. R. China 
Email: zchen@stn.sh.cn. 

Location/Qualifiers 

1. .1114 

/organism="Homo sapiens" 
/molJ:ype="mRNA" 
/db_xref="taxon:9606" 
/clone="CBMAPH03" 

/tissue_type= M umbilical cord blood" 
/cell_type="cd34+ stem cell" 

/clone_lib- "Human mRNA from cd34 + stem cells" 
275 a 300 c 286 g 253 t 


Query Match 47.9%; Score 991.6; DB 9; Length 1114; 

Best Local Similarity 97.4%; Pred. No. 3.2e-119; 

Matches 1093; Conservative 0; Mismatches 19; Indels 10; 


Gaps 


Qy 712 G GAGGT AT CGAC C ACT C GCT GT AC ACAGGC AGT CT CT GGTAT ACAC C CAT CCGGCGG GAG 771 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 GGAG GT AT CGAC CAC T C GCT GT AC ACAGGCAGT CT CT GGTAT AC AC C CAT C C GGC G GGAG 60 

Qy 772 T G GT AT TAT GAGGT CAT CATT GT GC G GGT GGAGAT CAAT GGAC AGGAT CT GAAAAT GGAC 831 

i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 T GGT ATT AT GAGGT CAT CAT T GT G C GG GT GGAGAT CAAT GGACAGGAT C T GAAAAT GGAC 120 

Qy 832 TGCAAGGAGTACAACT ATGACAAGAGCATT GT GGACAGTGGCACCACCAAC CTT CGTTT G 891 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I II I I I I I II I I I I I I 
Db 121 T GCAAGGAGT ACAACTATGACAAGAGCATT GT GGACAGT GGCACCACCAACCTT CGTTTG 18 0 

Qy 892 CCCAAGAAAGTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAG 951 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
Db 181 C C CAAGAAAGT GT T T GAAGCT GC AGT CAAAT C CAT CAAGGC AGC C T CCT C CAC GGAGAAG 24 0 

Qy 952 TTCCCTGATGGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCT 1011 

I I I M I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | 
Db 241 TTCCCTGATGGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCT 300 

Qy 1012 TGGAACATTTTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTC 1071 

I I I M I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 T GGAACAT T T T C C CAGT CAT CT CAC T CTAC CT AAT GGGT GAGGT T ACCAAC C AGT CCT T C 360 

Qy 1072 C GC AT CAC CAT C CTT CC GC AGCAAT ACCT G C G GC C AGT GGAAGAT GT GGC CAC GT CC CAA 1131 

M I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I 
Db 361 CGCATCACCATCCTTCCGCAGCAATACCTGCGGCCA-TGGAAGATGTGGCCACGTCCCAA 419 

Qy 1132 GACGACTGTTACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTT 1191 

I I I I I I I II I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I i I I I I I I I I || I | | | | | 
Db 420 GACGACTGTTAC-AGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTT 47 8 

Qy 1192 ATCATGGAGGGCTTCTA-CGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGT 1250 

I I I I I M I I i I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I III 
Db 47 9 ATCATGGAGGGCTTCTACCGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTGCTTGT 538 

Qy 1251 CAGCGCTTGCCATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCAC 1310 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
Db 539 C AG C GCT T GC CAT GT GC AC GAT GAGTT CAGGACGGCAGC G GT GGAAG G C C C - T TT GT CAC 597 

Qy 1311 CTTGGACAT GGAAGACT GTGGCTACAACATT CCACAGACAGAT GAGTCAACCCTCATGAC 1370 

I I I I I I M I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I M I I I I 
Db 598 CTT GGACAT GGAAGAC T GT GGCT ACAACATT C CACAGACAGAT GAGT CAAC C CT C AT GAC 657 

Qy 1371 CATAGCCTATGTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGT 1430 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 658 CATAGCCTATGTCATGGCTGCCATCTGCG-CCTCTTCATGCTGCCACTCTGCCTCATGGT 716 

Qy 14 31 GTGTCAG-TGGCGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACA 1489 

If I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 717 GT GT CAGAT GGCGCTGCCTC C GCT GC CT GC GC C AGC AGCAT GAT GAC TAT GCT GAT GACA 776 

Qy 14 90 T CT C CCT GCT GAAGT GAGGAGG C C CAT G GGC AGAAGAT AGAGATT CC C CT GGACCACAC C 1549 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 7 77 TCTCCCTGCT GAAGT GAGGAGGC C CAT G GGCAGAAGAT AGAGATT C C C CT G GACCACAC C 836 

Qy 1550 TCCGTGGTT C ACT TT G GT CACAAGT AGGAGAC AC AGAT G GC ACCT GT GGC C AGAGC AC CT 1609 


I 1 1 1 1 1 1 III I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 837 TACGTGGTCACTTTGGCAC AGT AGGAGACAC AGAT GGCAC CT GT GGC CAGAGCAC CT 893 

Qy 1610 CAGGACCCTCCCCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAG 1669 

I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 

Db 894 CAGGAC C C T T C C C ACC C AC CAAAT GC CT CT GCAT T GAT GGAGAAGGAAAAGGCT G GCAAG 953 

Qy 1670 GT GGGTTCCAGGGACT GTACCT GTAGGAAACAGAAAAGAGAAGAAAGAAGCACT CT GCTG 1729 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 954 GT GGGT T C C AG GGACT G - AC CT GT AGGAAAC AGAAAAGAGAAGAAAGAAGC ACT CT GCT G 1012 

Qy 1730 GCGGGAATACTCTTGGTCACCTC7\AATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCA 17 8 9 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I 
Db 1013 GCGGGAATACTCTTGGTCACCTCAAATTTAAGTCGGAAAATTCTGCTGCTTGAAACTTCA 1072 

Qy 1790 GCCCTGAACCTTTGTCCACCATTCCTTTAAATTCTCCAACCC 1831 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1073 AC C CT GAAC CT TT GT CAAC CAT T C C T TTAAATT ATACAAC C C 1114 
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REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
MEDLINE 
PUBMED 

REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 


TITLE 


AK014390 2504 bp mRNA linear HTC 07-DEC-2002 

Mus musculus adult male brain cDNA, RIKEN full-length enriched 
library, clone : 35264 02A15 product : beta-site APP cleaving enzyme, 
full insert sequence. 
AK014390 

AK0143 90.2 GI: 2638 038 0 
HTC; CAP trapper. 
Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 

Carninci,P. and Hayashizaki, Y. 

High-efficiency full-length cDNA cloning 

Meth. Enzymol. 303, 19-44 (1999) 

99279253 

10349636 

2 

Carninci,P., Shibata,Y., Hayatsu,N., Sugahara,Y., Shibata,K., 

Itoh,M., Konno,H., Okazaki,Y., Muramatsu,M. and Hayashizaki, Y. 

Normalization and subtraction of cap-trapper-selected cDNAs to 

prepare full-length cDNA libraries for rapid discovery of new genes 

Genome Res. 10 (10), 1617-1630 (2000) 

20499374 

11042159 

3 

Shibata,K., Itoh,M., Aizawa,K., Nagaoka,S., Sasaki, N., 
Konno,H., Akiyama,J., Nishi,K., Kitsunai,T., Tashiro,H. 
Sumi,N., Ishii,Y., Nakamura, S . , Hazama,M., Nishine,T. 
Yamamoto,R., Matsumoto, H . , Sakaguchi , S . , Ikegami,T., Kashiwagi, K. , 
Fujiwake,S., Inoue,K., Togawa,Y., Izawa,M. , Ohara,E., Watahiki,M., 
Yoneda,Y., Ishikawa,T., Ozawa,K., Tanaka,T., Matsuura,S., Kawai,J., 
Okazaki,Y., Muramatsu,M. , Inoue,Y., Kira,A. and Hayashizaki, Y. 
RIKEN integrated sequence analysis (RISA) system — 384-format 
sequencing pipeline with 384 multicapillary sequencer 


Carninci, P . 

Itoh,M. , 
Harada, A. , 


JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 


TITLE 
JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
REFERENCE 
AUTHORS 


TITLE 
JOURNAL 


Genome Res. 10 (11), 1757-1771 (2000) 

20530913 

11076861 

4 

Kawai,J., Shinagawa, A. , Shibata,K., Yoshino,M., Itoh,M., Ishii,Y., 
Arakawa,T., Hara,A., Fukunishi, Y . , Konno,H. , Adachi,J., Fukuda,S., 
Aizawa,K., Izawa,M. , Nishi,K., Kiyosawa,H., Kondo,S., Yamanaka, I . , 
Saito,T., Okazaki,Y., Gojobori,T., Bono,H., Kasukawa^T., Saito,R., 
Kadota,K., Matsuda,H., Ashburner,M. , Batalov, S., Casavant,T. , 
Fleischmann, W. , Gaasterland, T . , Gissi,C, King,B., Kochiwa,H., 
Kuehl,P., Lewis, S., Matsuo,Y., Nikaido,I., Pesole f G. f 
Quackenbush, J. , Schriml, L. M. , Staubli,F., Suzuki, R. , Tomita,M. , 
Wagner, L., Washio,T., Sakai,K., Okido,T., Furuno,M., Aono,H., 
Baldarelli , R. , Barsh,G., Blake, J., Boffelli,D., Bojunga,N., 
Carninci,P., de Bonaldo, M. F. , Browns tein,M. J. , Bult,C, 
Fletcher, C, Fujita,M., Gariboldi , M. , Gustincich, S . , Hill,D., 
Hofmann, M. , Hume, D. A., Kamiya,M. , Lee, N . H . , Lyons, P., 
Marchionni, L. , Mashima, J. , Mazzarelli, J. , Mombaerts , P . , Nordone,P., 
Ring,B., Ringwald,M., Rodriguez, I . , Sakamoto, N., Sasaki, H . , 
Sato,K., Schonbach, C. , Seya,T., Shibata,Y., Storch,K.F., Suzuki, H., 
Toyo-oka,K., Wang,K*H., Weitz,C, Whittaker , C . , Wilming, L. , 
Wynshaw-Boris, A. , Yoshida,K., Hasegawa,Y., Kawaji,H., Kohtsuki,S. 
and Hayashizaki, Y. 

Functional annotation of a full-length mouse cDNA collection 

Nature 409 (6821), 685-690 (2001) 

21085660 

11217851 

5 

The FANTOM Consortium and the RIKEN Genome Exploration Research 
Group Phase I & II Team. 

Analysis of the mouse transcriptome based on functional annotation 
of 60,770 full-length cDNAs 
Nature 420, 563-573 (2002) 
6 (bases 1 to 2504) 
Adachi,J., Aizawa,K., Akahira,S 
Arakawa,T., Bono,H., Carninci,P 
Furuno,M., Hanagaki,T., Hara,A. 
Hiraoka,T., Hori,F., Imotani,K. , Ishii,Y 
Kasukawa,T., Kato,H., Kawai,J., Kojima,Y 


Akimura,T., Arai,A. , Aono,H., 
Fukuda,S., Fukunishi, Y. , 
Hayatsu,N., Hiramoto,K., 

Itoh,M., Izawa,M., 
Konno , H . , Kouda , M . , 


COMMENT 


Koya,S., Kurihara,C, Matsuyama, T . , Miyazaki,A., Nishi,K., 
Nomura, K. , Numazaki,R., Ohno,M., Okazaki,Y., Okido,T., Owa,C, 
Saito,H., Saito,R., Sakai,C, Sakai,K., Sano,H., Sasaki, D., 
Shibata,K., Shibata,Y., Shinagawa, A. , Shiraki,T., Sogabe,Y., 
Suzuki, H., Tagami,M., Tagawa,A. , Takahashi, F. , Tanaka,T., 
Tejima,Y., Toya,T., Yamamura,T., Yasunishi, A. , Yoshida,K., 
Yoshino,M., Muramatsu,M. and Hayashizaki , Y. 
Direct Submission 

Submitted ( 10- JUL-2000 ) Yoshihide Hayashizaki, The Institute of 
Physical and Chemical Research (RIKEN) , Laboratory for Genome 
Exploration Research Group, RIKEN Genomic Sciences Center (GSC) , 
RIKEN Yokohama Institute; 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, 
Kanagawa 230-0045, Japan (E-mail :genome-res@gsc. riken. go. jp, 
URL : http : //genome. gsc.riken. go. jp/, Tel: 81-45-503-9222, 
Fax:81-45-503-9216) 

On Dec 10, 2002 this sequence version replaced gi: 12852207. 
Please visit our web site (http://genome.gsc.riken.go.jp/) for 
further details . 


cDNA library was prepared and sequenced in Mouse Genome 
Encyclopedia Project of Genome Exploration Research Group in Riken 
Genomic Sciences Center and Genome Science Laboratory in RIKEN. 
Division of Experimental Animal Research in Riken contributed to 
prepare mouse tissues. First strand cDNA was primed with a primer 
[5* GAG AG AG AG AAG GAT C C AA G AG CTCTTTTTTTTTTTTTTTT VN 3'], cDNA was 
prepared by using trehalose thermo-activated reverse transcriptase 
and subsequently enriched for full-length by cap-trapper. Second 
strand cDMA was prepared with the primer adapter of sequence [5 1 
GAGAGAGAGAT T C T C GAGT TAAT T AAATTAAT CCCCCCCCCCCCC 3']. cDNA was cleaved 
wi*th BamHI and Xhol . Vector: a modified pBluescript KS ( + ) after 
bulk excision^f rom Lambda FLC I. Cloning sites, 5 ? end: Sail; 3 T 
end: BamHI. Host: DH10B. 
FEATURES Location/Qualifiers 
source 1. .2504 

/organism="Mus mus cuius" 

/mol_type="mRNA" 

/strain="C57BL/6J" 

/ db_xr e f = " FANTOM_DB : 3 5 2 6 4 0 2 Al 5 " 

/ db_x r e f = "MGI : 1 8 9 6 7 2 9 " 

/db_xref="taxon: 10090" 

/clone="352 64 02Al5" 

/sex-"male" 

/ tissue_type="brain" 

/clone_lib="RIKEN full-length enriched mouse cDNA library" 
/devjstage="adult" 
CDS <1. .804 

/note="unnamed protein product; beta-site APP cleaving 

enzyme (MGD | MGI : 1346542 , GB | NM_0117 92 , evidence: BLASTN, 

98%, match-3874) 

putative" 

/ codon_start=l 

/protein_id="BAB2 9317 . 2 " 

/db_xref="GI: 26380381" 

/db_xref="MGI: 134 6542" 

/ 1 r ans lat ion= "MI I GGI DHS LYTGSLWYTPI RREWYYEVI I VRVEINGQDLKMDC 
KEYNYDKSIVDSGTTNLRLPKKVFEAAVKSIKAASSTEKFPDGFWLGEQLVCWQAGTT 
PWNIFPVISLYLMGEVTNQSFRITILPQQYLRPVEDVATSQDDCYKFAVSQSSTGTVM 
GAVIMEGFYWFDRARKRIGFAVSACHVHDEFRTAAVEGPFVTADMEDCGYNIPQTDE 
STLMTIAYVMAAI CALFMLPLCLMVCQWRCLRCLRHQHDDFADDISLLK" 
polyAjsignal 2483. .2488 

/not e= "putative" 
polyA_site 2504 

/ note="putative " 

BASE COUNT 616 a 637 c 631 g 620 t 

ORIGIN 

Query Match 46.4%; Score 960.8; DB 11; Length 2504; 

Best Local Similarity 86.3%; Pred. No. 1.9e-115; 

Matches 1167; Conservative 0; Mismatches 172; Indels 13; Gaps 9; 

Qy 703 AT GAT CAT T G GAGGT AT C GAC C ACT CGCT GT AC AC AGGC AGT CT CT GGT AT ACAC C CAT C 762 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 ATGATCATTGGTGGTATCGACCACTCGCTATACACGGGCAGTCTCTGGTACACACCCATC 60 


QY 


7 63 CGGCGGGAGT GGTATTAT GAGGT CATCATT GT GCGGGTGGAGAT CAAT GGACAGGATCT G 822 
I I I I I I I I I I I I I I I II I I I II I I I I I I I I II I I I I I I I I I I I I I II I I I II 


Db 61 CGGC GGGAGT GGTATTAT GAAGT GAT CATT GTACGT GTGGAAAT CAATGGT CAAGAT CT C 12 0 

Qy 823 AAAATGGAC T GCAAGGAGT ACAACT ATGACAAGAGC AT T GT G GAC AGT GGC AC CAC CAAC 882 

II I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I 

Db 121 AAGATGGACT GCAAGGAGT ACAACTACGACAAGAGCATT GTGGACAGTGGGAC CAC CAAC 18 0 

Qy 883 CTTCGTTTGCCCAAGAAAGTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCC 942 

I M I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I 
Db 181 CTTCGCTTGCCCAAGAAAGTATTTGAAGCTGCCGTCAAGTCCATCAAGGCAGCCTCCTCG 24 0 

Qy 943 ACGGAGAAGTTCCCTGATGGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGC 10 02 

I I I I I I M I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 

Db 241 ACGGAGAAGTTCCCGGATGGCTTTTGGCTAGGGGAGCAGCTGGTGTGCTGGCAAGCAGGC 300 

Qy 1003 AC CAC C C C T T GGAAC AT T TT CC C AGT CAT CT CAC T C T AC CTAAT GGGT GAGGT TACCAAC 10 62 

II I I I II I I I I I I II I I I I I I I I I I I I I II I I I Mill I I I I I I II II I I I I I 

Db 301 AC GAC C CCT T GGAAC AT TTT C C C AGT CAT T T C ACT T T AC CT CAT GG GT GAAGT CAC CAAT 360 

Qy 1063 CAGTCCTTCCGCATCACCATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCC 1122 

I I I I I I I II I I I II I II I I I I I I I I I I I I I I I I I I M I I I I I I I I I I II I I I I I I 
Db 361 CAGTCCTTCCGCATCACCATCCTTCCTCAGCAATACCTACGGCCGGTGGAGGACGTGGCC 42 0 

Qy 1123 AC GT CC CAAGAC GACT GTT ACAAGTT T GC CAT C T C AC AGTC AT C CAC GG GCACT GT TAT G 1182 

I I I I I I I I I I I I 1 I I I I II i I M I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 421 AC GT C C CAAGAC GACT GTT ACAAGTT C GC T GT CT C AC AGTCAT C C AC GGG CACT GT TAT G 480 

Qy 1183 GGAGCTGTTATCATGGAGGGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGC 1242 

I I I I I I I I I I I I I I I II I I I I I II I I I I I I M I I MINIM I I M I II I I 
Db 481 GGAGCCGTCATCATGGAAGGTTTCTATGTCGTCTTCGATCGAGCCCGAAAGCGAATTGGC 54 0 

Qy 124 3 TTTGCTGTCAGCGCTTGCCATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCT 13 02 

I I M I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I II II I I M I I I I II 
Db 541 TTTGCTGTCAGCGCTTGCCATGTGCACGATGAGTTCAGGACGGCGGCAGTGGAAGGTCCG 600 

Qy 1303 TTT GT CAC CTT GGAC AT GGAAGAC T GT GGCT ACAAC AT T C C ACAGACAGAT GAGT CAAC C 1362 

I I I I I II I I I I I I I I I II I I I I I I I I I I I I [ I I I I | t I I I I I I I I I I I I I I I I 

Db 601 TTT GTT AC GGCAGAC AT GGAAGACT GT GGCT ACAAC AT T C C C C AGAC AGAT GAGT CAAC A 660 

Qy 1363 CTCATGACCATAGCCTATGTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGC 1422 

M I I II I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II | | | || | | | | M 
Db 661 CTTATGACCATAGCCTATGTCATGGCGGCCATCTGCGCCCTCTTCATGTTGCCACTCTGC 720 

Qy 1423 CTCATGGTGTGTCAGTGGCGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCT 1482 

I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I M I I I I I I I I I I I II I I I I I 
Db 721 CTCATGGTATGTCAGTGGCGCTGCCTGCGTTGCCTGCGCCACCAGCACGATGACTTTGCT 7 80 

Qy 14 83 GATGACAT CTCC CT GCTGAAGT GAGGAGGCCCAT GGGCAGAAGATAGAGATTCC CCT GGA 1542 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I III I I If I I I 1 I I I I 
Db 781 GAT GAC AT CT C C CT GCT CAAGT AAGGAGG C CC GT GGG CAGAT GATGGAGAC GC C CCT GGA 840 

Qy 1543 CCACACCTCCGTGGTTCACTTTGGTCACA — AGTAGGAGACACAGAT GGCACCT GT GGCC 1600 

I I I I I II I I I I I I I I I I M I I I I I I I I I I I I I I II I M I I I I I I I I I I 
Db 841 CCACATCTGGGTGGTTCCCTTTGGTCACATGAGTTGGAGCTATGGATGGTACCTGTGGCC 900 

Qy 1601 AGAGC AC CT CAGGAC C CT C C C CAC C CAC CAAAT GC CT CT GC CT T GAT GGAGAAGGAAAAG 1660 

I I I I I I I I I I I I I II I I I I II Ill MM 

Db 901 AGAG CAC CT C AG GAC C CT CAC CAAC C T GC CAAT G CT T CT GGC GT GAC AGA- ACAGAGAAA 959 


Qy 1661 GCTGGCAAGGTGGGTTCCAGGGACTGTACCTGTAGGAAACAGAAAAGAGAAGAAAGAAGC 1720 

I I I I I I I III II I I I I I M I I I I I I I I I I I I I I I M | | | | Ml III 
Db 960 T CAGGCAAGCT GGATTACAGGGCTT GCACCTGT AGGACACAGGAGAGGGAAGGAAGCAGC 1019 

Qy 1721 ACTCTGCTGGCGGGAATACTCTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTT 178 0 

M I I I I I I I I I I I I I I I I I II | | I | | | | | | | | | M I I I I I I I I I I I 
Db 1020 GT T C T G GT GGC AGGAAT AT C CT T AGAC ACCACAAAC T T GAGT - T GGAAAT TTTGCTGCTT 107 8 

Qy 1781 GAAACT T CAGC C CT GAAC CT T T GT C C AC CAT T C CT T TAAATT CT CCAAC C CAAAGT ATT C 184 0 

I I I I I I I I I I I I I I I Ml || I I I II I I I I I I I I I I I I I I M I I I I I II I I I 
Db 1079 GAAGCTTCAGCCCTGACCCTCTGCCCAGCA-TCCTTTAGAGTCTCCAACCTAAAGTATTC 1137 

Qy 1841 TTCTTTTCTTAGTTTCAGAAGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCT 1900 

M IN I I I I I I I I I I I I I I I I M I I I I I | M I I I I I I I I I I I I I I 

Db 1138 TTTATGTC C T T C C AGAAGT ACT GGC GT CAT ACT C AGGC T AC C - C GG C AT GT GT C C C T 1193 

Qy 1901 GT GGT AC C C T GGCAGAGAAGAGAC CAAGC T T GT T T C C CT GCTGGC CAAAGT C AGT AGGAG 1960 

I M I I I I I I I I I I I I I I I I I I I I I I | I I II I I I I I I I I I II I I I I I I II II 
Db 1194 GT GGT AC C CT GGCAGAGAAAGG G C CAAT C - T CAT T C C CT GCT GG C CAAAGT C AG CAGAAG 1252 

Qy 1961 AG GAT G CACAGT T T GCT AT T T GCT T TAGAGAC AGGGACT GTAT AAACAAGC CTAACAT T G 2 02 0 

I II I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I 

Db 1253 A — AAGTGAAGTTTGCCAGTTGCTTTAGT GAT AGGGACT GCAGACTCAAGCCT-ACACTG 130 9 

Qy 2 021 GT GCAAAGATT GC C T CT T GAAT T AAAAAAAAA 2 052 

II I I I I I I III I I I I I I I I I I II II 

Db 1310 GTACAAAGACT GCGT CTT GAGATAAACAAGAA 1341 


RESULT 12 

BX463709/c 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 


REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 


BX463709 1025 bp mRNA linear EST 22-MAY-2003 

BX463709 Homo sapiens ADULT BRAIN Homo sapiens cDNA clone 
CS0DN005YN12 3-PRIME, mRNA sequence. 
BX463709 

BX4 63709. 1 GI : 31033554 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 1025) 

Li,W.B., Gruber,C, Jessee,J. and Polayes,D. 
Full-length cDNA libraries and normalization 
Unpublished 
Contact: Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref@genoscope.cns.fr, Web : www.genoscope.cns.fr 
Library was constructed by Life Technologies, a division of 
Invitrogen. This sequence belongs to sequence cluster 5902. r For 
more information about this cluster, see 
http : //www. genoscope . ens . f r/ 

cgi-bin/cluster . cgi?seq=CS0DN005DG06NPl&cluster=5902. r. Contact : 
Feng Liang Email : fliang@lifetech.com URL : 

http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 


Faraday Avenue Genoscope sequence ID : CSODN005DG06NP1 . 
FEATURES Location/Qualifiers 
source 1. .1025 

/organism="Homo sapiens" 
/mol_type= M mRNA" 
/db_xref="taxon: 9606 M 
/clone="CS0DN005YN12" 
/tissue_type="ADULT BRAIN" 
/ dev_stage="adult" 

/clone_lib="Homo sapiens ADULT BRAIN" 

/note="Organ: brain; Vector: pCMVSPORT_6; 1st strand cDNA 
was primed with a Notl-oligo (dT) primer. Five prime end 
enriched, double-strand cDNA was digested with Not I and 
cloned into the Not I and EcoRV sites of the pCMVSPORT 6 
vector. Library was not normalized." 

BASE COUNT 246 a 249 c 281 g 240 t 9 others 

ORIGIN 

Query Match 45.2%; Score 935.8; DB 13; Length 1025; 

Best Local Similarity 98.1%; Pred. No. 5.4e-112; 

Matches 972; Conservative 5; Mismatches 11; Indels 3; Gaps 3; 

Qy 1008 CCCTTGGAACATTTTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAA.ee- AGT 1066 

Ml I I I I I I I I I I I I II I I I : I I I I I I I I I I I II M I I I I I I I I I I I I I : I I I I II I 
Db 100 0 CCCCTTGAACATTTTCCCAGTCWTCTCACTCTACCTAATGGGTGAGGTTACMAACCAAGT 941 

Qy 1067 CCT T C C GCAT C ACC - AT C CT T C C GCAGCAAT AC CT GCGG C CAGT G GAAGAT GT GGC C AC G 1125 

I : I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I 
Db 94 0 CYTCCCGCATCACCAATCCTTCCGCAGCAAATACTGCGGCCAGTGGAAGATGTGGCCACG 881 

Qy 112 6 T - C C CAAGAC GACT GTT ACAAGT T T GC CAT CT CAC AGT CAT C C AC GGG CACT GT TAT GG G 1184 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 8 80 TCC C CAAGAC GACT GT T ACAAGT TT GC CAT CT CAC AGT CAT C C AC GGGC ACT GT T AT GG G 821 

Qy 1185 AGCTGTTATCATGGAGGGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTT 1244 

M I I I M I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I M I I I M I I I I I I 
Db 820 AGCTGTTATCATGGAGGGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTT 761 

Qy 1245 TGCTGTCAGCGCTTGCCATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTT 1304 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
Db 7 60 TGCTGTCAGCGCTTGCCATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTT 701 

Qy 1305 T GT CAC C TT GGACAT GGAAGACT GT G GCT ACAAC ATT C C AC AGACAGAT GAGT CAAC CCT 1364 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I M I I I I I I I 

Db 7 00 T GT C AC CT T GGACAT GGAAGACT GT GGCT ACAAC ATT CCAC AGACAGAT GAGT CAAC CCT 641 

Qy 1365 CATGACCATAGCCTATGTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCT 1424 

I I I I M I I I I I I I I I M I I I I M I I II I I I M I I I I M I I I I I I I I I I I I I I I I I I I I I I 
Db 640 CATGACCATAGCCTATGTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCT 581 

Qy 1425 CAT G GT GT GT CAGT GGC GCT GC CT C CGCTGCCTGCGCCAGCAGCAT GAT GACT TT GCT GA 14 8 4 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 580 CATGGTGTGTCAGTGGCGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGA 521 

Qy 14 85 T GAC AT C T C C CT GCT GAAGT GAGGAGGC C CAT GGGC AGAAGAT AGAGAT T C C C CT G GACC 154 4 

I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I M I I II 
Db 520 T GACAT CT C C CT GCT GAAGT GAGGAGGC C CAT GG GCAGAAGAT AGAGAT TCC C CT GGAC C 461 


Qy 

Db 


1545 
460 


1604 
401 


Qy 1605 CAC CT C AGGAC C CT C C C CACC CAC CAAAT GC CT CT GC CTT GAT G GAGAAGGAAAAG G CT G 1664 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 400 CAC C T CAGGAC C CTC C C CACC CAC CAAAT GC C T CT GC CTT GAT GGAGAAGGAAAAG G CT G 341 

Qy 1665 GCAAGGT GGGT T C CAGGGACT GT AC CT GT AGGAAACAGAAAAGAGAAGAAAGAAGC ACT C 1724 

I I I I I I I I I II I I I I I I I I I I I I I I. I I I I I I I I I I I I I I I I | | | | | | | | M | | | | | | | | | 
Db 34 0 GCAAG GT GGGT T C C AG GGACT GTAC CT GT AGGAAACAGAAAAGAGAAGAAAGAAGC ACT C 2 81 

Qy 172 5 TGCTGGCGGGAATACTCTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAA 1784 

M I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I II II I I I I I I I II I I I I I II 
Db 280 TGCTGGCGGGAATACTCTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAA 221 

Qy 1785 CTTCAGCCCTGAACCTTTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCT 1844 

I I M I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I | | | | | || | 
Db 22 0 CTTCAGCCCTGAACCTTTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCT 161 

Qy 184 5 TTTCTTAGTTTCAGAAGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGG 1904 

I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | 
Db 160 TTTCTTAGTTTCAGAAGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGG 101 

Qy 1905 T AC C CT G GC AGAGAAGAGAC CAAGCT T GT T T C C CT GC TGG C CAAAGT CAGT AG GAGAG GA 1964 

I I I I I I M I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | I I I :: I I I I I I I I I I I 
Db 100 T AC C CT GGCAGAGAAGAGAC CAAGCT T GT TT C C CT G CT GGC CAAAVH C CAT AGGAGAGGA 41 

Qy 1965 T GCAC AGT T T GCT AT T T G CT T T AGAGAC AGG 1995 

I I I I I I I I I I I I I I I I I I I I I I I M I I I 
Db 4 0 TNCACAGCTTGCTATTTGCTMTAGAGACAGG 10 


RESULT 13 

BQ949536 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 


REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 


BQ949536 913 bp mRNA linear EST 21-AUG-2002 

AGENCOURT_8 733256 NIH_MGC_101 Homo sapiens cDNA clone IMAGE: 6454803 
5 1 , mRNA sequence. 
BQ949536 

BQ949536.1 GI:22365014 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 913) 

NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 

Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: ATCC 
cDNA Library Preparation: Rubin Laboratory 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Agencourt Bioscience Corporation 
Clone distribution: MGC clone distribution information can be 


found through the I.M.A. G.E. Consortium/ LLNL at: 
http : / /image . llnl . gov 
Plate: LLCM2632 row: h column: 04 
High quality sequence stop: 613. 
FEATURES Location/Qualifiers 
source 1. .913 

/organism="Homo sapiens" 

/mo l_t ype= "mRNA" 

/db_xref="taxon: 9606" 

/ clone-" IMAGE: 6454803" 

/tissue_type="epidermoid carcinoma, cell line" 
/lab_host="DH10B (phage-resistant ) " 
/clone_lib="NIH_MGC_101" 

/note="Organ: lung; Vector: pOTB7; Site_l : EcoRI; Site_2: 
Xhol; cDNA made by oligo-dT priming. Directionally cloned 
into EcoRl/XhoI sites using the following 5' adaptor: 
GGCACGAG ( G) . Library constructed by Ling Hong in the 
laboratory of Gerald M. Rubin (University of California, 
Berkeley) using ZAP-cDNA synthesis kit (Stratagene) and 
Superscript II RT (Life Technologies) . Note: this is a 
NIH_MGC Library." 

BASE COUNT 207 a 259 c 232 g 206 t 9 others 

ORIGIN 

Query Match 39.6%; Score 819.6; DB 13; Length 913; 

Best Local Similarity 96.8%; Pred. No. 6e-97; 

Matches 864; Conservative 0; Mismatches 26; Indels 3; Gaps 3; 

Qy 904 T T TGAAGCT G C AGTCAAAT CCAT C AAGGCAGC CT C CT C C AC GGAGAAGT T CCCT GAT GGT 963 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I ! | 
Db 1 T T TGAAG CT GC AGT CAAAT C CAT CAAGGCAGC CT C C T C C AC GGAGAAGTT CC CT GAT GGT 60 

Qy 964 TTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATTTTC 1023 

M I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I II I I I I I I I I I I I I I I I I I 
Db 61 T T CT GGC T AG GAGAGC AG C T GGT GT G CT GG CAAGCAGG C AC CAC C C CT T GGAAC AT T T T C 120 

Qy 1024 C C AGT CAT CT CACT CT AC CTAAT GGGT GAG GTT AC CAACCAGT C CT T C C GC AT CAC CAT C 1083 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II | | | | 
Db 121 CCAGTCATCT CACT CTACCTAAT GGGT GAGGTTACCAACCAGTCCTTCCGCATCACCATC 180 

Qy 108 4 CTTC C GC AGCAAT AC CT GC GG C C AGTGGAAGAT GT GGC CAC GT C C CAAGAC GACT GT T AC 1143 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I 
Db 181 CTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGACTGTTAC 240 

Qy 114 4 AAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATCATGGAGGGC 1203 

I I I I I I I I I M I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 AAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATCATGGAGGGC 300 

Qy 1204 TTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGCCAT 1263 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | I 
Db 301 TTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGCCAT 360 

Qy 1264 GTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGACATGGAA 1323 

I I I I I I I I I M I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I M I I I I I I I I I I | | | | | 
Db 361 GTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGACATGGAA 420 


Qy 


1324 


GACTGT GGCTACAACATT CCACAGACAGATGAGT CAACCCTCAT GACCATAGC CTATGT C 


1383 


Db 


I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

421 GACT GT GGC T ACAACAT T C CAC AGACAGAT GAGT CAACC CT CAT GAC C AT AGC CT AT GT C 48 0 


Qy 138 4 ATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGGCGC 1443 

I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 ATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGGCGC 540 

Qy 14 44 T GC CT C C GC T GC CTGCGCCAGCAG CAT GAT GACT TTGCT GAT GACATCTCCCTGCTGAAG 1503 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I 

Db 541 TGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTGAAG 600 

Qy 1504 T GAGGAGG C C CAT GGGCAGAAGAT AGAGAT T C C C CT G GAC CAC ACCT C C GTGGT T CACTT 1563 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 601 T GAGGANGC C CAT GN GCAGAAGAT AGAGAT T CC C CT GGAC CAC ACCT C C GT GGT T C ACT T 660 

Qy 1564 T GGT C ACAAGT AGGAGAC AC AGAT GGC AC C T GT GGC C AGAGCAC CT CAGGAC C CT C C C C A 1623 

I I M II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 

Db 661 TGGT C ACAAGTAGGAG ACACAGAT GGC AC C CT GT GGC AGAGCAC CT CAGGAC C CT C C C CA 720 

Qy 162 4 CCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCA-GGG 1682 

I I M I I I I I I I I I I I I I M I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I M I I III 
Db 721 CCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAAGCTGGCAAGGTGNGTTCCAGGGG 7 80 

Qy 168 3 ACT GTACCT GTAGGAAACAGAAAAGAGAAGAAAGAAGCACTCT GCTGGCGGGAATACT C - 1741 

I I I II I I I I Ml I I I I I I I I I I I I I I I II I I I I | | | | | | | | | | | 

Db 7 81 ACT GT AC C CT GT AGGAACNN GANN AGAGAGAAAGAAGC AC T CT GCT G GC GGGAAT ACT CN 84 0 

Qy 1742 TTGGTCACCTCAAATTTAAGTCGGGAAATTCT-GCTGCTTGAAACTTCAGCCC 1793 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I Ml II 
Db 841 TTGGTCACCTCAAATTTAAGTCGGGAAATTCTGGCTGCTTGAAACTCCAGGCC 8 93 


RESULT 14 

BU557282 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 


REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 


Craniata; Vertebrata; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 


BU557282 928 bp mRNA linear EST 16-SEP-2002 

AGENCOURT_10253167 NIH_MGC_109 Homo sapiens cDNA clone 
IMAGE: 6585124 5', mRNA sequence. 
BU557282 

BU5572 82. 1 GI : 22 90757 8 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 928) 
NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 
Email: cgapbs-r@mail.nih.gov 
Tissue Procurement: ATCC 
cDNA Library Preparation: Rubin Laboratory 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Agencourt Bioscience Corporation 
Clone distribution: MGC clone distribution information can be 

found through the I.M.A.G.E. Consortium/LLNL at: 

http : / / image . llnl . gov 


FEATURES 

source 


BASE COUNT 
ORIGIN 


Plate: LLCM2796 row: j column: 04 
High quality sequence start: 28 
High quality sequence stop: 688. 

Location/Qualifiers 

1. .928 

/organism="Homo sapiens" 
/mol_t ype= "mRNA" 
/db_xref="taxon: 9606" 
/clone- "IMAGE: 6585124" 

/tissue_type="teratocarcinoma, cell line" 
/lab_host="DH10B (phage-resistant ) " 
/clone_lib="NIH_MGC_109" 
/note^"Organ: ovary; Vector: pOTB7 ; 
Xhol; cDNA made by oligo-dT priming 
into EcoRI/XhoI sites using the following 5' adaptor: 
GGCACGAG ( G) . Library constructed by Ling Hong in the 
laboratory of Gerald M. Rubin (University of California, 
Berkeley) using ZAP-cDNA synthesis kit (Stratagene) and 
Superscript II RT (Life Technologies) . Note: this is a 
NIH_MGC Library. " 
205 a 254 c 251 g 217 t 1 others 


Site_l: EcoRI; Site_2 : 
Directionally cloned 


Query Match 38.2%; Score 791.2; DB 13; Length 928; 

Best Local Similarity 98.1%; Pred. No. 2.8e-93; 

Matches 822; Conservative 0; Mismatches 13; Indels 3; 


Gaps 


2; 


Qy 


Db 


452 GCCCCAACGTCACTGTGCGTGCCAACATTGCTGCCATCACTGAATCAGACAAGTTCTTCA 511 
I I I I I I M I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
34 GGCCCAACGTCACTGTGCGTGCCAACATTGCTGCCATCACTGAATCAGACAAGTTCTTCA 93 


Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 


512 TCAACGGCTCCAACTGGGAAGGCATCCTGGGGCTGGCCTATGCTGAGATTGCCAGGCCTG 571 
I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
94 TCAACGGCTCCAACTGGGAAGGCATCCTGGGGCTGGCCTATGCTGAGATTGCCAGGCCTG 153 

572 ACGACTCCCTGGAGCCTTTCTTTGACTCTCTGGTAAAGCAGACCCACGTTCCCAACCTCT 631 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
154 ACGACTCCCTGGAGCCTTTCTTTGACTCTCTGGTAAAGCAGACCCACGTTCCCAACCTCT 213 

632 TCTCCCTGCACCTTTGTGGTGCTGGCTTCCCCCTCAACCAGTCTGAAGTGCTGGCCTCTG 691 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
214 TCTCCCTGCAGCTTTGTGGTGCTGGCTTCCCCCTCAACCAGTCTGAAGTGCTGGCCTCTG 273 

692 T C G GAG GGAGCAT GAT CAT T GGAGGT AT C GAC CACT C GCT GT AC ACAGG C AGT CT CT GGT 751 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | 
274 TCGGAGGGAGCAT GAT CAT TGGAGGTATC GAC CACT C GCT GTACACAGGCAGTCTCT GGT 333 

752 AT AC AC C CAT C C GG C G GGAGT GGT AT TAT GAGGT CAT CAT T GT GC G G GT GGAGAT CAAT G 811 

I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 

334 AT AC AC C CAT C C GGC GGGAGT GGTATT AT GAGGT CAT CAT T GT GC GGGT GGAGAT CAAT G 393 

812 GACAGGAT CT GAAAAT GGACTGCAAGGAGT ACAACTAT GACAAGAGCATTGT GGACAGT G 871 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I | I 
394 GAC AGGATCTGAAAATGGACT GCAAGGAGTACAACTAT GACAAGAGCATTGTGGACAGT G 4 53 


Qy 


872 GCAC C AC CAAC CT T C GT TT GC C CAAGAAAGT GTT T GAAGCT G C AGT CAAAT C CAT CAAG G 931 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I M I I I I I I I II I I I I I I I I I I 


Db 


4 54 GCAC C ACCAAC CTTCGTTTGCC CAAGAAAGT GTT T GAAG CT GC AGT CAAAT C CAT CAAGG 513 


Qy 932 CAGCCTCCTCCACGGAGAAGTTCCCTGATGGTTTCTGGCTAGGAGAGCAGCTGGTGTGCT 991 

M I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | M | | | 
Db 514 CAGCCTCCTCCACGGAGAAGTTCCCTGATGGTTTCTGGCTAGGAGAGCAGCTGGTGTGCT 573 

Qy 992 GGCAAGCAGGCACCACCCCTTGGAACATTTTCCCAGTCATCTCACTCTACCTAATGGGTG 1051 

I M II I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | M I I I I I 

Db 574 GGCAAGCAGGCACCACCCCTTGGAACATTTTCCCAGTCATCTCACTCTACCTAATGGGTG 633 

Qy 1052 AGGTTACCAACCAGTCCTTCCGCATCACCATCCTTCCGCAGCAATACCTGCGGCCAGTGG 1111 

I I I I I I M I 1 I I I I I I I I I I I I II I I I I I I I I I I I I I I | M I I I I I I I I I M I I I I I I I I 
Db 634 AGGTTACCAACCAGTCCTTCCGCATCACCATCCTTCCGCAGCAATACCTGCGGCCAGTGG 693 

Qy 1112 AAGAT GT GG C CAC GT CC CAAGAC GACTGT T ACAAGT TT GCCAT CT C AC AGT CAT C CAC GG 1171 

I I I I M I II I M I I I I I II I II I I I I I I I I I I I I I I I I I II | || | | | | | || | | | | | | | || 
Db 694 AAGAT GT G GC CAC GT C C CAAGAC GACT GT T ACAAGT T T GC CAT CT CAC AGT CAT C CAC GG 753 

Qy 1172 GCACTGTTATGGGAGCTGTTATCATGGAGGGCTTCTACGTTGTCTTTGATCGGGCCC — G 1229 

I M I I I I I I II I I I I I I I I M I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 754 GCACTGTTATGGGAGCTGTTATCATGGAGGGCTTCTACGTTGTCTTTGATCGGGCCCCGA 813 

Qy 1230 AAAACGAATTGGCTTTGCTGTCA-GCGCTTGCCATGTGCACGATGAGTTCAGGACGGC 1286 

Ml I I I I I I I I I I I I M I I I I I I I I I I I I I I M I II II I I I I I M II 
Db 814 AAAC GAAT T G GG CTT T G C T GT CAG GCGCT T GC CAT GT GC AC GAT GAAGT CAGGGAC GC 871 


RESULT 15 

AL570757/c 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 


REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 


FEATURES 


AL570757 858 bp mRNA linear EST 31-MAY-2003 

AL570757 Homo sapiens PLACENTA COT 25-NORMALIZED Homo sapiens cDNA 
clone CS0DI022YF22 3-PRIME, mRNA sequence. 
AL570757 

AL5707 57.2 GI : 312 92174 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 858) 

Li,W.B., Gruber,C, Jessee, J. and Polayes, D. 
Full-length cDNA libraries and normalization 
Unpublished 

On Feb 16, 2001 this sequence version replaced gi : 12927378. 
Contact: Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref@genoscope.cns.fr, Web : www.genoscope.cns.fr 
Library was constructed by Life Technologies, a division of 
Invitrogen. This sequence belongs to sequence cluster 5902. r For 
more information about this cluster, see 
http : //www. genoscope . ens . f r/ 

cgi-bin/cluster . cgi?seq=CS0DI022DCHNPl&cluster=5902 . r . Contact : 
Feng Liang Email : fliang@lifetech.com URL : 

http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CSODI 022DC11NP1 . 
Location/ Qualifiers 


source 1. .858 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon: 9606" 
/clone="CS0DI022YF22" 

/tissue_type="PLACENTA COT 25-NORMALIZED" 
/clone_lib="Homo sapiens PLACENTA COT 25-NORMALIZED" 
/note="lst strand cDNA was primed with a Notl-oligo (dT) 
primer. Five prime end enriched, double-strand cDNA was 
digested with Not I and cloned into the Not I and EcoR V 
sites of the pCMVSPORT 6 vector. Library was normalized." 

BASE COUNT 214 a 205 c 224 g 200 t 15 others 

ORIGIN 

Query Match 38.0%; Score 787.4; DB 9; Length 858; 

Best Local Similarity 97.2%; Pred. No. 9e-93; 

Matches 824; Conservative 5; Mismatches 16; Indels 3; Gaps 3; 

Qy 114 8 TTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATCATGGAGGGCTTCT 1207 

M : I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 845 TTSCCATCTCACATTCATCCAC-GGCACTGTTAT-GGAGCTGTTATCAT-GARGGCTTCT 789 

Qy 1208 ACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGCCATGTGC 1267 

II I I I I I I I I II I I I I I II 1111111111111:11 I I I I I I I II I I I I I I I I I 

Db 7 88 ACWTTGTCTTTGATCGGGCCRGAAAACGAATTGGYTTGCTRTTCAGCGCTTGCCATGTGC 729 

Qy 1268 AC GAT GAGTT CAGGAC G G C AGC G GT GGAAGGC CCT T TT GT C AC CT T G GACAT G GAAGACT 1327 

I I I I I I M II I I II II I I II I I I I II I I I II I I I I I I I I | | | | : | | | I I I I I I I I I II I 
Db 72 8 AC GAT GAGT T CAGGAC GGCAGC GGT GGAAG GC C CT T TT GT C AC YT T G YACAT G GAAGACT 669 

Qy 132 8 GT GG CTACAACAT T C CACAGACAGAT GAGT CAAC C CT C AT GAC CAT AGC CT AT GT CAT GG 1387 

I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I | 
Db 668 GT GGCTACAACATT CCACAGACAGAT GAGT CAAC CCTCAT GACCATAGCCT AT GT CAT GG 609 

Qy 1388 CTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGGCGCTGCC 14 47 

I I I I I I I I I I I I I I I I I I I I I I I I I I I | M | | | | | | | | | | | | | | | | | | | | | | | | | | || 
Db 608 CTGCCATNTGCGCCCTCTTNATGCTGCCACTCTGCCTCATGGTGTGTCAGTGGCGCTGCC 54 9 

Qy 144 8 TCCGCTGCCT GCGC CAGCAGCAT GAT GACTT T GCT GAT GAC ATCT C CCT GCT GAAGT GAG 1507 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 

Db 54 8 TCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTGAAGTGAG 48 9 

Qy 1508 GAGGC CCAT GGGCAGAAGAT AGAGAT T C C CCT GGAC CAC AC CTC C GT G GT T CACT TT GGT 1567 

I M I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | 
Db 488 GAGGCC CAT GGGCAGAAGAT AGAGAT T C C CCT GGAC CAC AC C T C C GT GGT T CACT T T GGT 42 9 

Qy 1568 CACAAGT AGGAGACAC AGAT G G CAC C T GT GGC CAGAG C AC CT CAGGAC C C T C C C CAC C C A 1627 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I 
Db 428 CACAAGT AGGAGACACAGAT GGCACCT GT GGCCAGAGCACCT CAGGACCCT C CCCACC CA 369 

Qy 1628 CCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCAGGGACTGT 1687 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I | I | 
Db 368 CCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCAGGGACTGT 309 

Qy 1688 AC CT GT AGGAAAC AGAAAAGAGAAGAAAGAAGCACTCT GCT GGC GG GAAT ACT CTT GGT C 174 7 

M I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 308 ACCT GT AGGAAAC AGAAAAGAGAAGAAAGAAGCACTCT GCT GGCGGGAATACT CTT GGTC 24 9 


Qy 174 8 ACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCTTTGTCCA 18 07 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I II 
Db 24 8 ACCTCT^AATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCTTTGTCCA 18 9 

Qy 1808 CCATTCCTTTAAATTCTCC7VA.CCCAAAGTATTCTTCTTTTCTTAGTTTCAGAAGTACTGG 1867 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I M I I I I I 
Db 18 8 CCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAAGTACTGG 12 9 

Qy 1868 CAT CACAC GCAGGT T AC CTT G GC GT GT GT CCCT GT GGT AC C CT G G C AGAGAAGAGAC CAA 1927 

I M I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I | I I | | | | | | | : | | | | 
Db 12 8 CATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAAGAGWCCAA 69 

Qy 1928 GCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCTATTTGCTTTA 1987 

I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 68 GCTTGTTTCCCTGCTNNCCAAATTCAGTAGGAGAGGATGCACJ\NTTTGCTATTTCCTTTA 9 

Qy 1988 GAGACAGG 1995 

I I I I I I I I 
Db 8 GAGACAGG 1 


Search completed: January 
Job time : 4456 sees 


21, 2004, 14:43:37 


GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 

OM nucleic - nucleic search, using sw model 

Run on: January 21 , 2004, 09:28:36 ; Search time 7650 Seconds 

(without alignments) 
11069.664 Million cell updates/sec 

Title: US-09-869-414A-3 
Perfect score: 2070 

Sequence: 1 atggcccaagccctgccctg aaaaaaaaaaaaaaaaaaaa 2070 

Scoring table: IDENTITY JMUC 

Gapop 10.0 , Gapext 1.0 

Searched: 288871 1 seqs, 20454813386 residues 

Total number of hits satisfying chosen parameters: 5777422 

Minimum DB seq length: 0 
Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 

Database : GenEmbl:* 


1: 

gb_ba:* 

2: 

gb_htg:* 

3: 

gbjn:* 

4: 

gb_om:* 

5: 

gb_pv:* 

6: 

gb_pat:* 

7: 

gb_ph:* 


8: gbj>l:* 

9: gb_pr:* 

10: gb_ro:* 

11: gb_sts:* 

12: gb_sy:* 

13: gb„un:* 

14: gb_vi:* 

15: em_ba:* 

16: em_fun:* 

17: em_hum:* 

18: em_in:* 

19: em_mu:* 

20: em_om:* 

21: em_or:* 

22: em_ov:* 

23: em„pat:* 

24: em_ph:* 

25: em_pl:* 

26: em_ro:* 

27: em_sts:* 

28: em_un:* 

29: em_vi:* 

30: em_htg_hum:* 

31: em_htg_inv:* 

32: em_htg_other:* 

33: em_htg_mus:* 

34: em_htg„pln:* 

35: em_htg__rod:* 

36: em_htg_mam:* 

37: em_htg_vrt:* 

38: em_sy:* 

39: em_htgo_hum:* 

40: emj-itgojmus:* 

41: em„htgo_other:* 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 


and is derived by analysis of the total score distribution. 
SUMMARIES 

% 

Result Query 
No. Score Match Length DB ID Description 


1 

2070 

100.0 

2070 

6 AX1 05385 

AX1 05385 Sequence 

2 

2070 

100.0 

2070 

6 AX378015 

AX3780 15 Sequence 

3 

2070 

100.0 

2070 

6 AX573823 

AX573823 Sequence 

4 

2068.4 

99.9 

2070 

6 AR224093 

AR224093 Sequence 

5 

2068.4 

99.9 

2070 

6 AR269224 

AR269224 Sequence 

6 

2068.4 

99.9 

2070 

6 AX700447 

AX700447 Sequence 

7 

2068.4 

99.9 

2070 

9 AF200343 

AF200343 Homo sapi 

8 

2062 

99.6 

2526 

6 AX700446 

AX700446 Sequence 

9 

2062 

99.6 

2526 9 AF1 90725 

AF 190725 Homo sapi 

10 

2052.6 

99.2 

CQ~7Q 
OO/O 

y ArzU14bo 

AF201468 Homo sapi 

11 

2040 

98.6 

5814 

9 AB032975 

AB032975 Homo sapi 

12 

2032 

98.2 

2541 

6 AR1 78469 

AR1 78469 Sequence 

13 

2032 

98.2 

2541 

6 AX002655 

AX002655 Sequence 

14 

2032 

98.2 

2541 

6 AX700452 

AX700452 Sequence 

15 

2032 

98.2 

2541 

6 E50816 

E5081 6 Aspartate p 

16 

2010 

97.1 

3252 

6 AR305033 

AR305033 Sequence 

17 

2010 

97.1 

3252 

6 AX062111 

AX062111 Sequence 

18 

2010 

97.1 

3252 

6 AX063201 

AX063201 Sequence 

19 

2010 

97.1 

3252 

6 AX472368 

AX472368 Sequence 

20 

2010 

97.1 

3252 

6 AX700453 

AX700453 Sequence 

21 

2010 

97.1 

3252 

9 AF200193 

AF200193 Homo sapi 

22 

1894.2 

91.5 

5757 

6 AX364933 

AX364933 Sequence 

23 

1887.2 

91.2 

1977 

6 AR224094 

AR224094 Sequence 

24 

1887.2 

91.2 

1977 

6 AR269225 

AR269225 Sequence 

25 

1887.2 

91.2 

1977 

6 AX1 05387 

AX1 05387 Sequence 

26 

1887.2 

91.2 

1977 

6 AX378017 

AX378017 Sequence 

27 

1887.2 

91.2 

1977 

6 AX573825 

AX573825 Sequence 

28 

1843.4 

89.1 

2370 

6 AR1 78470 

AR1 78470 Sequence 

29 

1843.4 

89.1 

2370 

6 AX002657 

AX002657 Sequence 

30 

1843.4 

89.1 

2370 

6 E50817 

E5081 7 Aspartate p 


31 1788.8 86.4 1979 6 BD127853 

32 1788.8 86.4 1979 9 AK075049 

33 1706.8 82.5 2174 9 BC036084 

34 1655.8 80.0 1746 6 AX616339 

35 1644.8 79.5 1747 6 BD139308 

36 1554 75.1 4059 10 BC048189 

37 1552.4 75.0 2043 6 AR224095 

38 1552.4 75.0 2043 6 AR269226 

39 1552.4 75.0 2043 6 AX1 05389 

40 1552.4 75.0 2043 6 AX573827 

41 1552.4 75.0 2043 10 AF200346 

42 1548.4 74.8 3880 10 AF1 90726 

43 1502.8 72.6 1506 6 AX700448 

44 1502.8 72.6 1506 9 AF204943 

45 1499.8 72.5 1527 6 BD103723 


BD1 27853 Primer fo 
AK075049 Homo sapi 
BC036084 Homo sapi 
AX616339 Sequence 
BD1 39308 Extended 
BC048189Mus muscu 
AR224095 Sequence 
AR269226 Sequence 
AX1 05389 Sequence 
AX573827 Sequence 
AF200346 Mus muscu 
AF 190726 Mus muscu 
AX700448 Sequence 
AF204943 Homo sapi 
BD1 03723 Beta seer 


ALIGNMENTS 


RESULT 1 
AX1 05385 

LOCUS AX105385 2070 bp DNA linear PAT 30-APR-2001 

DEFINITION Sequence 3 from Patent WO0123533. 

ACCESSION AX105385 

VERSION AX105385.1 GM3921511 

KEYWORDS . 

SOURCE Homo sapiens (human) 
ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 

AUTHORS Gurney,M. and Bienkowski.M.J. 

TITLE Alzheimer's disease secretase, app substrates therefor, and uses 
therefor 

JOURNAL Patent: WO 0123533-A 3 05-APR-2001 ; 
Pharmacia & Upjohn Company (US) 


FEATURES Location/Qualifiers 
source 1. .2070 

/organism-'Homo sapiens" 

/mol_type="genomic DNA" 

/db_xref="taxon:9606" 
BASE COUNT 476 a 583 c 562 g 449 1 
ORIGIN 

Query Match 100.0%; Score 2070; DB 6; Length 2070; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 2070; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 

ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTG 
CCCAC 60 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1 

ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTG 
CCCAC 60 

Qy 61 

GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCC 
CTGGGG 120 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
Db 61 

GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCC 
CTGGGG 120 

Qy 121 

CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGC 
AGCTTT 180 

lllllllillllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 121 

CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGC 
AGCTTT 180 


Qy 181 

GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGA 
TGACC 240 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 181 

GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGA 
TGACC 240 

Qy 241 

GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTT 
TGCA 300 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 241 

GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTT 
TGCA 300 

Qy 301 

GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAG 
CACA 360 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 301 

GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAG 
CACA 360 

Qy 361 

TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGG 
GGAG 420 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 361 

TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGG 
GGAG 420 

Qy 421 

CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAA 
CATT 480 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 


Db 421 

CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAA 
CATT480 

Qy 481 

GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATC 
CTG 540 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 481 

GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATC 
CTG 540 

Qy 541 

GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGAC 
TCT 600 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 541 

GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGAC 
TCT 600 

Qy 601 

CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGC 
TTC660 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 601 

CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGC 
TTC660 

Qy 661 

CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGG 
TATC 720 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 661 

CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGG 
TATC 720 


Qy 721 

GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTA 
TTAT 780 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 721 

GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTA 
TTAT 780 

Qy 781 

GAGGTCATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGCAA 
GGAG 840 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 781 

GAGGTCATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGCAA 
GGAG 840 

Qy 841 

TACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCCAAG 
AAA 900 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 841 

TACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCCAAG 
AAA 900 

Qy 901 

GTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCT 
GAT 960 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 901 

GTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCT 
GAT 960 

Qy 961 

GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAA 
CATT 1020 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 


Db 961 

GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAA 
CATT 1020 

Qy 1021 

TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCA 
CC 1080 

lllllllllllllllllllllllllllllllllllllllllllillllllllllllllll 
Db 1021 

TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCA 
CC 1080 

Qy 1081 

ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGA 
CTGT 1140 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1081 

ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGA 
CTGT 1140 

Qy 1141 

TACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATCATG 
GAG 1200 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1141 

TACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATCATG 
GAG 1200 

Qy 1201 

GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCT 
TGC 1260 

llllllllllllllllllllllllllllllllllllllllllllllllllilllllllll 
Db 1201 

GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCT 
TGC 1260 


Qy 1261 

CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGA 
CATG 1320 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1261 

CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGA 
CATG 1320 

Qy 1321 

GAAGACTGTGGCTACAACATTCCACAGACAGATGAGTCAACCCTCATGACCATAGCC 
TAT 1380 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1321 

GAAGACTGTGGCTACAACATTCCACAGACAGATGAGTCAACCCTCATGACCATAGCC 
TAT 1380 

Qy 1381 

GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAG 
TGG 1440 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1381 

GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAG 
TGG 1440 

Qy 1441 

CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTG 
CTG 1500 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1441 

CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTG 
CTG 1500 

Qy 1501 

AAGTGAGGAGGCCCATGGGCAGAAGATAGAGATTCCCCTGGACCACACCTCCGTGG 
TTCA1560 

lllllllllllllllllllllllllllllllllllllllllllllllllllllllillll 


Db 1501 

AAGTGAGGAGGCCCATGGGCAGAAGATAGAGATTCCCCTGGACCACACCTCCGTGG 
TTCA 1560 

Qy 1561 

CTTTGGTCACAAGTAGGAGACACAGATGGCACCTGTGGCCAGAGCACCTCAGGACC 
CTCC 1620 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1561 

CTTTGGTCACAAGTAGGAGACACAGATGGCACCTGTGGCCAGAGCACCTCAGGACC 
CTCC 1620 

Qy 1621 

CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTT 
CCAG 1680 

llllllllllilllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1621 

CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTT 
CCAG 1680 

Qy 1681 

GGACTGTACCTGTAGGAAACAGAAAAGAGAAGAAAGAAGCACTCTGCTGGCGGGAA 
TACT 1740 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1681 

GGACTGTACCTGTAGGAAACAGAAAAGAGAAGAAAGAAGCACTCTGCTGGCGGGAA 
TACT 1740 

Qy 1741 

CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAAC 
CT 1800 



Db 1741 

CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAAC 
CT 1800 


Qy 1801 

TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 
1860 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1801 

TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 
1860 

Qy 1861 

GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGA 
GAAG 1920 

lllllllllllllllllllllllllllllllllllllllllllllllllllllllillll 
Db 1861 

GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGA 
GAAG 1920 

Qy 1921 

AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCT 
ATT 1980 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1921 

AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCT 
ATT 1980 

Qy 1981 

TGCTTTAGAGACAGGGACTGTATAAACAAGCCTAACATTGGTGCAAAGATTGCCTCTT 
GA 2040 

llllllllllllllllllllllllllllllllllllllllllilllllllllllllllll 
Db 1981 

TGCTTTAGAGACAGGGACTGTATAAACAAGCCTAACATTGGTGCAAAGATTGCCTCTT 
GA 2040 


Qy 2041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2070 

llllllllllllllllllllllllllllll 
Db 2041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2070 


RESULT 2 
AX378015 

LOCUS AX378015 2070 bp DNA linear PAT 18-MAR-2002 

DEFINITION Sequence 1 from Patent WO0206306. 

ACCESSION AX378015 

VERSION AX378015.1 Gl: 19574050 

KEYWORDS . 

SOURCE Homo sapiens (human) 
ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 

AUTHORS Yan.R., Tomasselli.A.G., Gurney.M.E., Emmons.TL- Bienkowski.M.J. 

and Heinrikson,R.L. 
TITLE Substrates and assays for _g(b)-secretase activity 
JOURNAL Patent: WO 0206306-A 1 24-JAN-2002; 
PHARMACIA & UPJOHN COMPANY (US) 
FEATURES Location/Qualifiers 
source 1. .2070 

/organism- 'Homo sapiens" 
/mol_type="genomic DNA" 
/db_xref="taxon:9606" 
BASE COUNT 476 a 583 c 562 g 449 1 
ORIGIN 

Query Match 100.0%; Score 2070; DB 6; Length 2070; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 2070; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 

ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTG 
CCCAC 60 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1 

ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTG 
CCCAC 60 


Qy 61 

GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCC 
CTGGGG 120 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 61 

GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCC 
CTGGGG 120 

Qy 121 

CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGC 
AGCTTT 1 80 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 121 

CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGC 
AGCTTT 180 

Qy 181 

GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGA 
TGACC 240 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 181 

GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGA 
TGACC 240 

Qy 241 

GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTT 
TGCA 300 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 241 

GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTT 
TGCA 300 

Qy 301 

GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAG 
CACA 360 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 


Db 301 

GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAG 
CACA 360 

Qy 361 

TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGG 
GGAG 420 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 361 

TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGG 
GGAG 420 

Qy 421 

CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAA 
CATT 480 



Db 421 

CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAA 
CATT 480 

Qy 481 

GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATC 
CTG 540 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 481 

GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATC 
CTG 540 

Qy 541 

GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGAC 
TCT 600 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 541 

GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGAC 
TCT 600 


Qy 601 

CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGC 
TTC 660 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 601 

CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGC 
TTC 660 

Qy 661 

CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGG 
TATC 720 

lllllllllllllillllllllllllllllllllllllllllllllllllllllllllll 
Db 661 

CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGG 
TATC 720 

Qy 721 

GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTA 
TTAT 780 

lllllllllllllllllllllllllllllllllllllllllllllllilllllillllll 
Db 721 

GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTA 
TTAT 780 

Qy 781 

GAGGTCATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGCAA 
GGAG 840 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 781 

GAGGTCATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGCAA 
GGAG 840 

Qy 841 

TACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCCAAG 
AAA 900 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 


Db 841 

TACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCCAAG 
AAA 900 

Qy 901 

GTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCT 
GAT 960 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 901 

GTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCT 
GAT 960 

Qy 961 

GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAA 
CATT1020 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 961 

GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAA 
CATT 1020 

Qy 1021 

TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCA 
CC 1080 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1021 

TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCA 
CC 1080 

Qy 1081 

ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGA 
CTGT1140 

lllllllllllllllllllllllllllllllllllllllllllllllllillllllllll 
Db 1081 

ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGA 
CTGT1140 


Qy 1141 

TACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATCATG 

GAG 1200 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1141 

TACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATCATG 
GAG 1200 

Qy 1201 

GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCT 
TGC 1260 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1201 

GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCT 
TGC 1260 

Qy 1261 

CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGA 
CATG 1320 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1261 

CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGA 
CATG 1320 

Qy 1321 

GAAGACTGTGGCTACAACATTCCACAGACAGATGAGTCAACCCTCATGACCATAGCC 
TAT 1380 

lllllllllllllllllllllllillllllllllllllllllllllllllllllllllll 
Db 1321 

GAAGACTGTGGCTACAACATTCCACAGACAGATGAGTCAACCCTCATGACCATAGCC 
TAT 1380 

Qy 1381 

GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAG 
TGG 1440 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 


Db 1381 

GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAG 
TGG 1440 

Qy 1441 

CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTG 
CTG 1500 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1441 

CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTG 
CTG 1500 

Qy 1501 

AAGTGAGGAGGCCCATGGGCAGAAGATAGAGATTCCCCTGGACCACACCTCCGTGG 
TTCA 1560 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1501 

AAGTGAGGAGGCCCATGGGCAGAAGATAGAGATTCCCCTGGACCACACCTCCGTGG 
TTCA 1560 

Qy 1561 

CTTTGGTCACAAGTAGGAGACACAGATGGCACCTGTGGCCAGAGCACCTCAGGACC 
CTCC 1620 

lllllllllllllllllllllillllllllllllllllllllllllllllllllllllll 
Db 1561 

CTTTGGTCACAAGTAGGAGACACAGATGGCACCTGTGGCCAGAGCACCTCAGGACC 
CTCC 1620 

Qy 1621 

CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTT 
CCAG 1680 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1621 

CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTT 
CCAG 1680 


Qy 1681 

GGACTGTACCTGTAGGAAACAGAAAAGAGAAGAAAGAAGCACTCTGCTGGCGGGAA 

TACT 1740 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1681 

GGACTGTACCTGTAGGAAACAGAAAAGAGAAGAAAGAAGCACTCTGCTGGCGGGAA 
TACT 1740 

Qy 1741 

CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAAC 
CT 1800 

llllllllllllllilllllllllllllllllllllllllllllllllllllllllllll 
Db 1741 

CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAAC 
CT 1800 

Qy 1801 

TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 
1860 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1801 

TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 
1860 

Qy 1861 

GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGA 
GAAG 1920 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1861 

GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGA 
GAAG 1920 

Qy 1921 

AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCT 
ATT 1980 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 


Db 1921 

AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCT 
ATT 1980 

Qy 1981 

TGCTTTAGAGACAGGGACTGTATAAACAAGCCTAACATTGGTGCAAAGATTGCCTCTT 
GA 2040 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1981 

TGCTTTAGAGACAGGGACTGTATAAACAAGCCTAACATTGGTGCAAAGATTGCCTCTT 
GA 2040 

Qy 2041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2070 

llllllllllllllllllllllllllllll 
Db 2041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2070 


RESULT 3 
AX573823 

LOCUS AX573823 2070 bp DNA linear PAT 07-JAN-2003 

DEFINITION Sequence 3 from Patent EP1249498. 

ACCESSION AX573823 

VERSION AX573823.1 GL27551477 

KEYWORDS . 

SOURCE Homo sapiens (human) 
ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 

AUTHORS Gurney.M. and Bienkowski.M.J. 

TITLE Alzheimer's disease secretase, app substrates therefor, and uses 
therefor 

JOURNAL Patent: EP 1249498-A 3 16-OCT-2002; 
PHARMACIA & UPJOHN COMPANY (US) 
FEATURES Location/Qualifiers 
source 1. .2070 

/organism-'Homo sapiens" 


/mol_type="genomic DNA" 

/db_xref="taxon:9606" 
BASE COUNT 476 a 583 c 562 g 449 1 
ORIGIN 

Query Match 100.0%; Score 2070; DB6; Length 2070; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 2070; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 

ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTG 
CCCAC 60 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1 

ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTG 
CCCAC 60 

Qy 61 

GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCC 
CTGGGG 120 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 61 

GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCC 
CTGGGG 120 

Qy 121 

CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGC 

AGCTTT 180 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 121 

CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGC 
AGCTTT 180 

Qy 181 

GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGA 
TGACC 240 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 


Db 181 

GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGA 
TGACC 240 

Qy 241 

GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTT 

TGCA 300 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 241 

GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTT 
TGCA 300 

Qy 301 

GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAG 
CACA360 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 301 

GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAG 
CACA 360 

Qy 361 

TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGG 
GGAG 420 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 361 

TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGG 
GGAG 420 

Qy 421 

CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAA 
CATT 480 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 421 

CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAA 
CATT 480 


Qy 481 

GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATC 
CTG 540 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 481 

GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATC 
CTG 540 

Qy 541 

GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGAC 
TCT 600 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 541 

GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGAC 
TCT 600 

Qy 601 

CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGC 
TTC660 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 601 

CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGC 
TTC 660 

Qy 661 

CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGG 
TATC 720 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 661 

CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGG 
TATC 720 

Qy 721 

GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTA 
TTAT 780 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 


Db 721 

GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTA 
TTAT 780 

Qy 781 

GAGGTCATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGCAA 
GGAG 840 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 781 

GAGGTCATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGCAA 
GGAG 840 

Qy 841 

TACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCCAAG 
AAA 900 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 841 

TACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCCAAG 
AAA 900 

Qy 901 

GTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCT 
GAT 960 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 901 

GTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCT 
GAT 960 

Qy 961 

GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAA 
CATT 1020 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 961 

GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAA 
CATT 1020 


Qy 1021 

TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCA 
CC 1080 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1021 

TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCA 
CC 1080 

Qy 1081 

ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGA 
CTGT 1140 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1081 

ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGA 
CTGT 1140 

Qy 1141 

TACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATCATG 
GAG 1200 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1141 

TACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATCATG 
GAG 1200 

Qy 1201 

GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCT 
TGC 1260 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1201 

GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCT 
TGC 1260 

Qy 1261 

CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGA 
CATG 1320 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 


Db 1261 

CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGA 
CATG 1320 

Qy 1321 

GAAGACTGTGGCTACAACATTCCACAGACAGATGAGTCAACCCTCATGACCATAGCC 
TAT 1380 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1321 

GAAGACTGTGGCTACAACATTCCACAGACAGATGAGTCAACCCTCATGACCATAGCC 
TAT 1380 

Qy 1381 

GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAG 
TGG 1440 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1381 

GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAG 
TGG 1440 

Qy 1441 

CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTG 
CTG 1500 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1441 

CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTG 
CTG 1500 

Qy 1501 

AAGTGAGGAGGCCCATGGGCAGAAGATAGAGATTCCCCTGGACCACACCTCCGTGG 
TTCA1560 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1501 

AAGTGAGGAGGCCCATGGGCAGAAGATAGAGATTCCCCTGGACCACACCTCCGTGG 
TTCA1560 


Qy 1561 

CTTTGGTCACAAGTAGGAGACACAGATGGCACCTGTGGCCAGAGCACCTCAGGACC 
CTCC 1620 

lllllllllllllllllillllllllllllllllllllllllllllllllllllllllll 
Db 1561 

CTTTGGTCACAAGTAGGAGACACAGATGGCACCTGTGGCCAGAGCACCTCAGGACC 
CTCC 1620 

Qy 1621 

CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTT 
CCAG 1680 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1621 

CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTT 
CCAG 1680 

Qy 1681 

GGACTGTACCTGTAGGAAACAGAAAAGAGAAGAAAGAAGCACTCTGCTGGCGGGAA 
TACT 1740 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1681 

GGACTGTACCTGTAGGAAACAGAAAAGAGAAGAAAGAAGCACTCTGCTGGCGGGAA 
TACT 1740 

Qy 1741 

CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAAC 
CT 1800 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1741 

CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAAC 
CT 1800 

Qy 1801 

TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 
1860 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 


Db 1801 

TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 
1860 

Qy 1861 

GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGA 
GAAG 1920 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1861 

GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGA 
GAAG 1920 

Qy 1921 

AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCT 
ATT 1980 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1921 

AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCT 
ATT 1980 

Qy 1981 

TGCTTTAGAGACAGGGACTGTATAAACAAGCCTAACATTGGTGCAAAGATTGCCTCTT 
GA 2040 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1981 

TGCTTTAGAGACAGGGACTGTATAAACAAGCCTAACATTGGTGCAAAGATTGCCTCTT 
GA 2040 

Qy 2041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2070 

llllllllllllllllllllllllllllll 
Db 2041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2070 


RESULT 4 
AR224093 

LOCUS AR224093 2070 bp DNA linear PAT 26-SEP-2002 

DEFINITION Sequence 3 from patent US 6440698. 


ACCESSION AR224093 
VERSION AR224093.1 Gl:23332753 
KEYWORDS . 
SOURCE Unknown. 
ORGANISM Unknown. 
Unclassified. 
REFERENCE 1 (bases 1 to 2070) 
AUTHORS Gurney.M.E., Bienkowski.M.J., Heinrikson.R.L, Parodi.L.A. and 
Yan.R. 

TITLE Alzheimer's disease secretase, APP substrates therefor, and uses 
therefor 

JOURNAL Patent: US 6440698-A 3 27-AUG-2002; 
FEATURES Location/Qualifiers 
source 1. .2070 

/organism="unknown" 
BASE COUNT 476 a 582 c 563 g 449 1 
ORIGIN 

Query Match 99.9%; Score 2068.4; DB 6; Length 2070; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 2069; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 
Qy 1 

ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTG 
CCCAC 60 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1 

ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTG 
CCCAC 60 

Qy 61 

GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCC 
CTGGGG 120 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 61 

GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCC 
CTGGGG 120 


Qy 121 

CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGC 
AGCTTT 180 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 121 

CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGC 
AGCTTT 180 

Qy 181 

GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGA 
TGACC 240 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 181 

GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGA 
TGACC 240 

Qy 241 

GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTT 
TGCA 300 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 241 

GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTT 
TGCA 300 

Qy 301 

GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAG 
CACA 360 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 301 

GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAG 
CACA 360 

Qy 361 

TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGG 
GGAG 420 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 


Db 361 

TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGG 
GGAG 420 

Qy 421 

CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAA 

CATT 480 

llllllllllllllllllllllllllllllllilllllllllllllllllllllllllll 
Db 421 

CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAA 
CATT 480 

Qy 481 

GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATC 
CTG 540 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 481 

GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATC 
CTG 540 

Qy 541 

GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGAC 
TCT 600 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 541 

GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGAC 
TCT 600 

Qy 601 

CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGC 
TTC 660 

lllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 601 

CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCAGCTTTGTGGTGCTGGC 
TTC 660 


Qy 661 

CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGG 

TATC 720 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 661 

CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGG 
TATC 720 

Qy 721 

GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTA 

TTAT 780 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 721 

GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTA 
TTAT 780 

Qy 781 

GAGGTCATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGCAA 
GGAG 840 

illlllllllllllllllllllllllllllllllllllllilllllllllllllllllll 
Db 781 

GAGGTCATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGCAA 
GGAG 840 

Qy 841 

TACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCCAAG 
AAA 900 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 841 

TACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCCAAG 
AAA 900 

Qy 901 

GTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCT 
GAT 960 

lllllllllllllllllllllllllllllillllllllllllllllllllllllllllll 


Db 901 

GTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCT 
GAT 960 

Qy 961 

GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAA 

CATT 1020 

llllllllllllllllllllllllllllllllllllllllllllllllllllilllllll 
Db 961 

GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAA 
CATT 1020 

Qy 1021 

TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCA 
CC 1080 

llllllllllllllllllllllllllllllllllllllilllllllllllllllllllll 
Db 1021 

TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCA 
CC 1080 

Qy 1081 

ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGA 

CTGT 1140 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1081 

ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGA 
CTGT 1140 

Qy 1141 

TACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATCATG 
GAG 1200 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1141 

TACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATCATG 
GAG 1200 


Qy 1201 

GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCT 

TGC 1260 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1201 

GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCT 
TGC 1260 

Qy 1261 

CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGA 

CATG 1320 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1261 

CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGA 
CATG 1320 

Qy 1321 

GAAGACTGTGGCTACAACATTCCACAGACAGATGAGTCAACCCTCATGACCATAGCC 
TAT 1380 

llllllllllllllllllllllllllllllllllllllllllllllllllilllllllll 
Db 1321 

GAAGACTGTGGCTACAACATTCCACAGACAGATGAGTCAACCCTCATGACCATAGCC 
TAT 1380 

Qy 1381 

GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAG 
TGG 1440 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1381 

GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAG 
TGG 1440 

Qy 1441 

CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTG 
CTG 1500 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 


Db 1441 

CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTG 
CTG 1500 

Qy 1501 

AAGTGAGGAGGCCCATGGGCAGAAGATAGAGATTCCCCTGGACCACACCTCCGTGG 

TTCA1560 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1501 

AAGTGAGGAGGCCCATGGGCAGAAGATAGAGATTCCCCTGGACCACACCTCCGTGG 
TTCA1560 

Qy 1561 

CTTTGGTCACAAGTAGGAGACACAGATGGCACCTGTGGCCAGAGCACCTCAGGACC 
CTCC 1620 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1561 

CTTTGGTCACAAGTAGGAGACACAGATGGCACCTGTGGCCAGAGCACCTCAGGACC 
CTCC 1620 

Qy 1621 

CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTT 

CCAG 1680 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1621 

CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTT 
CCAG 1680 

Qy 1681 

GGACTGTACCTGTAGGAAACAGAAAAGAGAAGAAAGAAGCACTCTGCTGGCGGGAA 
TACT 1740 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1681 

GGACTGTACCTGTAGGAAACAGAAAAGAGAAGAAAGAAGCACTCTGCTGGCGGGAA 
TACT 1740 


Qy 1741 

CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAAC 
CT 1800 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1741 

CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAAC 
CT 1800 

Qy 1801 

TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 
1860 

lllllllllllllllllllllllllllllllllllllllillllllllllllllllllll 
Db 1801 

TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 
1860 

Qy 1861 

GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGA 
GAAG 1920 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1861 

GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGA 
GAAG 1920 

Qy 1921 

AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCT 
ATT 1980 

lllllllllllllllllllllllllllllllllllllllllllllllllllllillllll 
Db 1921 

AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCT 
ATT 1980 

Qy 1981 

TGCTTTAGAGACAGGGACTGTATAAACAAGCCTAACATTGGTGCAAAGATTGCCTCTT 
GA 2040 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 


Db 1981 

TGCTTTAGAGACAGGGACTGTATAAACAAGCCTAACATTGGTGCAAAGATTGCCTCTT 
GA 2040 

Qy 2041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2070 

llllllllllllllllllllllllllllll 
Db 2041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2070 


RESULT 5 
AR269224 

LOCUS AR269224 2070 bp DNA linear PAT 10-APR-2003 

DEFINITION Sequence 3 from patent US 6500667. 

ACCESSION AR269224 

VERSION AR269224.1 Gl:29700192 

KEYWORDS . 

SOURCE Unknown. 

ORGANISM Unknown. 
Unclassified. 
REFERENCE 1 (bases 1 to 2070) 

AUTHORS Gurney.M.E., Bienkowski.M.J., Heinrikson.R.L., Parodi.LA. and 
Yan.R. 

TITLE Aspartyl protease 2 (Asp2) antisense oligonucleotides 
JOURNAL Patent: US 6500667-A 3 31-DEC-2002; 
FEATURES Location/Qualifiers 
source 1. .2070 

/organism="unknown" 
BASE COUNT 476 a 582 c 563 g 449 1 
ORIGIN 

Query Match 99.9%; Score 2068.4; DB 6; Length 2070; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 2069; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 
Qy 1 

ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTG 
CCCAC 60 


Illlllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

Db 1 

ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTG 
CCCAC 60 

Qy 61 

GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCC 
CTGGGG 120 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 61 

GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCC 
CTGGGG 120 

Qy 121 

CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGC 
AGCTTT 1 80 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 121 

CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGC 
AGCTTT 180 

Qy 181 

GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGA 
TGACC 240 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 181 

GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGA 
TGACC 240 

Qy 241 

GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTT 
TGCA 300 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 241 

GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTT 
TGCA 300 


Qy 301 

GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAG 

CACA 360 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 301 

GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAG 
CACA 360 

Qy 361 

TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGG 

GGAG 420 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 361 

TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGG 
GGAG 420 

Qy 421 

CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAA 

CATT 480 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 421 

CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAA 
CATT 480 

Qy 481 

GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATC 

CTG 540 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 481 

GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATC 
CTG 540 

Qy 541 

GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGAC 
TCT 600 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 


Db 541 

GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGAC 
TCT 600 

Qy 601 

CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGC 
TTC 660 

lllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 601 

CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCAGCTTTGTGGTGCTGGC 
TTC 660 

Qy 661 

CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGG 
TATC 720 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 661 

CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGG 
TATC 720 

Qy 721 

GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTA 

TTAT 780 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 721 

GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTA 
TTAT 780 

Qy 781 

GAGGTCATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGCAA 
GGAG 840 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 781 

GAGGTCATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGCAA 
GGAG 840 


Qy 841 

TACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCCAAG 
AAA 900 

lllllllllillllllllllllllllllllllllllllllllllllllllllllllllll 
Db 841 

TACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCCAAG 
AAA 900 

Qy 901 

GTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCT 
GAT 960 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 901 

GTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCT 
GAT 960 

Qy 961 

GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAA 
CATT 1020 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 961 

GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAA 
CATT 1020 

Qy 1021 

TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCA 
CC 1080 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1021 

TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCA 
CC 1080 

Qy 1081 

ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGA 
CTGT 1140 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 


Db 1081 

ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGA 
CTGT1140 

Qy 1141 

TACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATCATG 
GAG 1200 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1141 

TACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATCATG 
GAG 1200 

Qy 1201 

GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCT 
TGC 1260 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1201 

GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCT 
TGC 1260 

Qy 1261 

CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGA 
CATG 1320 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1261 

CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGA 
CATG 1320 

Qy 1321 

GAAGACTGTGGCTACAACATTCCACAGACAGATGAGTCAACCCTCATGACCATAGCC 
TAT 1380 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1321 

GAAGACTGTGGCTACAACATTCCACAGACAGATGAGTCAACCCTCATGACCATAGCC 
TAT 1380 


Qy 1381 

GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAG 
TGG 1440 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1381 

GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAG 
TGG 1440 

Qy 1441 

CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTG 
CTG 1500 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1441 

CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTG 
CTG 1500 

Qy 1501 

AAGTGAGGAGGCCCATGGGCAGAAGATAGAGATTCCCCTGGACCACACCTCCGTGG 
TTCA1560 

llllllllllllilllllllllllllllllllllllllllllllllllllllllllllll 
Db 1501 

AAGTGAGGAGGCCCATGGGCAGAAGATAGAGATTCCCCTGGACCACACCTCCGTGG 
TTCA1560 

Qy 1561 

CTTTGGTCACAAGTAG GAG ACACAGATGG CACCTGTGG CCAG AGCACCTCAGG ACC 
CTCC 1620 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1561 

CTTTGGTCACAAGTAGGAGACACAGATGGCACCTGTGGCCAGAGCACCTCAGGACC 
CTCC 1620 

Qy 1621 

CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTT 
CCAG 1680 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 


Db 1621 

CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTT 
CCAG 1680 

Qy 1681 

GGACTGTACCTGTAGGAAACAGAAAAGAGAAGAAAGAAGCACTCTGCTGGCGGGAA 
TACT 1740 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1681 

GGACTGTACCTGTAGGAAACAGAAAAGAGAAGAAAGAAGCACTCTGCTGGCGGGAA 
TACT 1740 

Qy 1741 

CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAAC 
CT 1800 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1741 

CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAAC 
CT 1800 

Qy 1801 

TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 
1860 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1801 

TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 
1860 

Qy 1861 

GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGA 
GAAG 1920 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1861 

GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGA 
GAAG 1920 


Qy 1921 

AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCT 
ATT 1980 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1921 

AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCT 
ATT 1980 

Qy 1981 

TGCTTTAGAGACAGGGACTGTATAAACAAGCCTAACATTGGTGCAAAGATTGCCTCTT 
GA 2040 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1981 

TGCTTTAGAGACAGGGACTGTATAAACAAGCCTAACATTGGTGCAAAGATTGCCTCTT 
GA 2040 

Qy 2041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2070 

llllllllllllllllllllllllllllll 
Db 2041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2070 


RESULT 6 
AX700447 

LOCUS AX700447 2070 bp DNA linear PAT 03-APR-2003 

DEFINITION Sequence 2 from Patent WO03012089. 

ACCESSION AX700447 

VERSION AX7Q0447.1 Gl:29536238 

KEYWORDS . 

SOURCE Homo sapiens (human) 
ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 

AUTHORS Yon.J., Cleasby.A., Bruinzeel.W.D., Masure.S.L, Tickle.l. and 
Sharff,A. 

TITLE Crystal structure of beta-site app cleaving enzyme (bace) and use 
thereof 


JOURNAL Patent: WO 03012089-A 2 13-FEB-2003; 

Astex Technology Limited (GB) ; JANSSEN PHARMACEUTICA N.V. (BE) 
FEATURES Location/Qualifiers 
source 1. .2070 

/organism-'Homo sapiens" 
/mol_type="genomic DNA" 
/db_xref="taxon:9606" 
BASE COUNT 476 a 582 c 563 g 449 1 
ORIGIN 

Query Match 99.9%; Score 2068.4; DB 6; Length 2070; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 2069; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 
Qy 1 

ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTG 
CCCAC 60 

llllllllllllllllilllllllllllllllllllllllllllllllllllllllllll 
Db 1 

ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTG 
CCCAC 60 

Qy 61 

GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCC 
CTGGGG 120 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 61 

GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCC 
CTGGGG 120 

Qy 121 

CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGC 
AGCTTT 180 

lillllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 121 

CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGC 
AGCTTT 180 


Qy 181 

GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGA 
TGACC 240 

lllllllllllllllllllllllllllllllllllllllllllllllllllllllllll! 
Db 181 

GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGA 
TGACC 240 

Qy 241 

GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTT 
TGCA300 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 241 

GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTT 
TGCA 300 

Qy 301 

GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAG 
CACA 360 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 301 

GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAG 
CACA 360 

Qy 361 

TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGG 
GGAG 420 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 361 

TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGG 
GGAG 420 

Qy 421 

CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAA 
CATT 480 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 


Db 421 

CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAA 
CATT 480 

Qy 481 

GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATC 
CTG 540 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 481 

GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATC 
CTG 540 

Qy 541 

GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGAC 
TCT 600 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 541 

GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGAC 
TCT 600 

Qy 601 

CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGC 
TTC 660 

lllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 601 

CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCAGCTTTGTGGTGCTGGC 
TTC 660 

Qy 661 

CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGG 
TATC 720 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 661 

CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGG 
TATC 720 


Qy 721 

GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTA 
TTAT 780 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 721 

GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTA 
TTAT 780 

Qy 781 

GAGGTCATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGCAA 
GGAG 840 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 781 

GAGGTCATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGCAA 
GGAG 840 

Qy 841 

TACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCCAAG 
AAA 900 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 841 

TACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCCAAG 
AAA 900 

Qy 901 

GTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCT 
GAT 960 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 901 

GTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCT 
GAT 960 

Qy 961 

GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAA 
CATT 1020 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 


Db 961 

GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAA 
CATT 1020 

Qy 1021 

TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCA 
CC 1080 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1021 

TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCA 
CC 1080 

Qy 1081 

ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGA 
CTGT 1140 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1081 

ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGA 
CTGT 1 140 

Qy 1141 

TACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATCATG 
GAG 1200 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1141 

TACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATCATG 
GAG 1200 

Qy 1201 

GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCT 
TGC 1260 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1201 

GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCT 
TGC 1260 


Qy 1261 

CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGA 
CATG 1320 

lllllllillllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1261 

CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGA 
CATG 1320 

Qy 1321 

GAAGACTGTGGCTACAACATTCCACAGACAGATGAGTCAACCCTCATGACCATAGCC 
TAT 1380 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1321 

GAAGACTGTGGCTACAACATTCCACAGACAGATGAGTCAACCCTCATGACCATAGCC 
TAT 1380 

Qy 1381 

GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAG 
TGG 1440 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1381 

GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAG 
TGG 1440 

Qy 1441 

CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTG 
CTG 1500 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1441 

CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTG 
CTG 1500 

Qy 1501 

AAGTGAGGAGGCCCATGGGCAGAAGATAGAGATTCCCCTGGACCACACCTCCGTGG 
TTCA1560 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 


Db 1501 

AAGTGAGGAGGCCCATGGGCAGAAGATAGAGATTCCCCTGGACCACACCTCCGTGG 
TTCA 1560 

Qy 1561 

CTTTGGTCACAAGTAGGAGACACAGATGGCACCTGTGGCCAGAGCACCTCAGGACC 
CTCC 1620 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1561 

CTTTGGTCACAAGTAGGAGACACAGATGGCACCTGTGGCCAGAGCACCTCAGGACC 
CTCC 1620 

Qy 1621 

CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTT 
CCAG 1680 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1621 

CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTT 
CCAG 1680 

Qy 1681 

GGACTGTACCTGTAGGAAACAGAAAAGAGAAGAAAGAAGCACTCTGCTGGCGGGAA 
TACT 1740 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1681 

GGACTGTACCTGTAGGAAACAGAAAAGAGAAGAAAGAAGCACTCTGCTGGCGGGAA 
TACT 1740 

Qy 1741 

CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAAC 
CT 1800 

llllllllilllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1741 

CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAAC 
CT 1800 


Qy 1801 

TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 
1860 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1801 

TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 
1860 

Qy 1861 

GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGA 
GAAG 1920 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1861 

GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGA 
GAAG 1920 

Qy 1921 

AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCT 
ATT 1980 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1921 

AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCT 
ATT 1980 

Qy 1981 

TGCTTTAGAGACAGGGACTGTATAAACAAGCCTAACATTGGTGCAAAGATTGCCTCTT 
GA 2040 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1981 

TGCTTTAGAGACAGGGACTGTATAAACAAGCCTAACATTGGTGCAAAGATTGCCTCTT 
GA 2040 


Qy 2041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2070 

llllllllllllllllllllllllllllll 
Db 2041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2070 


RESULT 7 
AF200343 

LOCUS AF200343 2070 bp mRNA linear PR! 12-DEC-1999 

DEFINITION Homo sapiens chromosome 1 1 aspartyl protease 2 mRNA, complete cds. 

ACCESSION AF200343 

VERSION AF200343.1 Gl:6561813 

KEYWORDS . 

SOURCE Homo sapiens (human) 
ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 2070) 
AUTHORS Yan,R., Bienkowski.MJ., Shuck,M.E., Miao,H., Tory.M.C., 

Pauley,A.M., Brashier.J.R., Stratman,N.C., Mathews,W.R., Buhl.A.E., 
Carter,D.B., Tomasselli,A.G., Parodi,LA, Heinrikson.R.L and 
Gurney,M.E. 

TITLE Membrane-anchored aspartyl protease with Alzheimer's disease 

beta-secretase activity 
JOURNAL Nature 402 (6761), 533-537 (1999) 
MEDLINE 20057170 
PUBMED 10591213 
REFERENCE 2 (bases 1 to 2070) 

AUTHORS Bienkowski.MJ., Shuck,M.E., Slightom.J.L and Drong,R.F. 
TITLE Direct Submission 

JOURNAL Submitted (29-OCT-1999) Genomics Research, Pharmacia&Upjohn, 301 
Henrietta, Kalamazoo, Ml 49007, USA 
FEATURES Location/Qualifiers 
source 1. .2070 

/organism- 'Homo sapiens" 
/moLtype="mRNA" 
/db_xref="taxon:9606" 
/chromosome-' 11" 
/map= M 11q23.3-24.1" 
CDS 1..1506 

/codon_start=1 

/product- 'aspartyl protease 2" 
/protein_id="AAF 1 7079. 1 " 


/db_xref="GI:6561814" 

/translation="MAQALPWLLLWMGAGVLPAHGTQHGIRLPLRSGLGGAPLGLRLP 

RETDEEPEEPGRRGSFVEMVDNLRGKSGQGYYVEMTVGSPPQTLNILVDTGSSNFAVG 

AAPHPFLHRYYQRQLSSTYRDLRKGVYVPYTQG KWEGELGTDLVSI PHGPNVTVRANI 

AAITESDKFFINGSNWEGILGLAYAEIARPDDSLEPFFDSLVKQTHVPNLFSLQLCGA 

GFPLNQSEVLASVGGSM I IGGI DHSLYTGSLWYTPIRREWYYEVII VRVE I NGQDLKM 

DCKEYNYDKSIVDSGTTNLRLPKKVFEAAVKSIKAASSTEKFPDGFWLGEQLVCWQAG 

TTPWNIFPVISLYLMGEVTNQSFRITILPQQYLRPVEDVATSQDDCYKFAISQSSTGT 

VMGAVIMEGFYWFDRARKRIGFAVSACHVHDEFRTAAVEGPFVTLDMEDCGYNIPQT 

DESTLMTIAYVMAAICALFMLPLCLMVCQWRCLRCLRQQHDDFADDISLLK" 
BASE COUNT 476 a 582 c 563 g 449 1 
ORIGIN 

Query Match 99.9%; Score 2068.4; DB 9; Length 2070; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 2069; Conservative 0; Mismatches 1 ; Indels 0; Gaps 0; 
Qy 1 

ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTG 
CCCAC 60 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1 

ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTG 
CCCAC 60 

Qy 61 

GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCC 
CTGGGG 120 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 


Db 61 

GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCC 
CTGGGG 120 

Qy 121 

CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGC 
AGCTTT 1 80 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 121 

CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGC 
AGCTTT 180 

Qy 181 

GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGA 
TGACC 240 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 181 

GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGA 
TGACC 240 

Qy 241 

GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTT 
TGCA 300 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 241 

GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTT 
TGCA 300 

Qy 301 

GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAG 
CACA 360 

. Illlllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 301 

GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAG 
CACA 360 


Qy 361 

TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGG 
GGAG 420 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 361 

TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGG 
GGAG 420 

Qy 421 

CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAA 
CATT 480 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 421 

CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAA 
CATT 480 

Qy 481 

GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATC 
CTG 540 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 481 

GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATC 
CTG 540 

Qy 541 

GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGAC 
TCT 600 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 541 

GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGAC 
TCT 600 

Qy 601 

CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGC 
TTC 660 

lllllllllllllllllllllllllllllllllllllllllllilllllllllllllll 


Db 601 

CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCAGCTTTGTGGTGCTGGC 
TTC 660 

Qy 661 

CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGG 
TATC 720 

lllllllllllllllllllllllllllllllllllllllllllllllllllllllillll 
Db 661 

CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGG 
TATC 720 

Qy 721 

GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTA 
TTAT 780 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 721 

GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTA 
TTAT 780 

Qy 781 

GAGGTCATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGCAA 
GGAG 840 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 781 

GAGGTCATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGCAA 
GGAG 840 

Qy 841 

TACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCCAAG 
AAA 900 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 841 

TACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCCAAG 
AAA 900 


Qy 901 

GTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCT 
GAT 960 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 901 

GTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCT 
GAT 960 

Qy 961 

GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAA 
CATT 1020 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 961 

GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAA 
CATT 1020 

Qy 1021 

TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCA 
CC 1080 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1021 

TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCA 
CC 1080 

Qy 1081 

ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGA 
CTGT 1140 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1081 

ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGA 
CTGT 1140 

Qy 1141 

TACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATCATG 
GAG 1200 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 


Db 1141 

TACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATCATG 
GAG 1200 

Qy 1201 

GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCT 
TGC 1260 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1201 

GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCT 
TGC 1260 

Qy 1261 

CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGA 
CATG 1320 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1261 

CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGA 
CATG 1320 

Qy 1321 

GAAGACTGTGGCTACAACATTCCACAGACAGATGAGTCAACCCTCATGACCATAGCC 
TAT 1380 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1321 

GAAGACTGTGGCTACAACATTCCACAGACAGATGAGTCAACCCTCATGACCATAGCC 
TAT 1380 

Qy 1381 

GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAG 
TGG 1440 

lllllllllllllllllllllllllllllllllllllllllillllllllllllllllll 
Db 1381 

GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAG 
TGG 1440 


Qy 1441 

CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTG 
CTG 1500 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1441 

CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTG 
CTG 1500 

Qy 1501 

AAGTGAGGAGGCCCATGGGCAGAAGATAGAGATTCCCCTGGACCACACCTCCGTGG 
TTCA 1560 

lllllllllllllllllllllllllllllllllllllllllllllllllillllllllll 
Db 1501 

AAGTGAGGAGGCCCATGGGCAGAAGATAGAGATTCCCCTGGACCACACCTCCGTGG 
TTCA 1560 

Qy 1561 

CTTTGGTCACAAGTAGGAGACACAGATGGCACCTGTGGCCAGAGCACCTCAGGACC 
CTCC 1620 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIHiillllllllllllllllll 
Db 1561 

CTTTGGTCACAAGTAGGAGACACAGATGGCACCTGTGGCCAGAGCACCTCAGGACC 
CTCC 1620 

Qy 1621 

CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTT 
CCAG 1680 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1621 

CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTT 
CCAG 1680 

Qy 1681 

GGACTGTACCTGTAGGAAACAGAAAAGAGAAGAAAGAAGCACTCTGCTGGCGGGAA 
TACT 1740 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 


Db 1681 

GGACTGTACCTGTAGGAAACAGAAAAGAGAAGAAAGAAGCACTCTGCTGGCGGGAA 
TACT 1740 

Qy 1741 

CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAAC 
CT 1800 

lllllllllllllllllllllllllllllllillllllllllllllllllllllllllll 
Db 1741 

CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAAC 
CT 1800 

Qy 1801 

TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 
1860 

llllllllllllllllllllllllllllllllllllllilllllllllllllllllllll 
Db 1801 

TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 
1860 

Qy 1861 

GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGA 
GAAG 1920 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1861 

GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGA 
GAAG 1920 

Qy 1921 

AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCT 
ATT 1980 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1921 

AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCT 
ATT 1980 


Qy 1981 

TGCTTTAGAGACAGGGACTGTATAAACAAGCCTAACATTGGTGCAAAGATTGCCTCTT 
GA 2040 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1981 

TGCTTTAGAGACAGGGACTGTATAAACAAGCCTAACATTGGTGCAAAGATTGCCTCTT 
GA 2040 

Qy 2041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2070 

llllllllllllllllllllllllllllll 
Db 2041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2070 


RESULT 8 
AX700446 

LOCUS AX700446 2526 bp DNA linear PAT 03-APR-2003 

DEFINITION Sequence 1 from Patent WO030 12089. 

ACCESSION AX700446 

VERSION AX700446.1 Gl:29536237 

KEYWORDS . 

SOURCE Homo sapiens (human) 
ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 

AUTHORS Yon.J., CleasbyA, Bruinzeel.W.D., Masure.S.L, Tickle.l. and 
Sharff,A. 

TITLE Crystal structure of beta-site app cleaving enzyme (bace) and use 
thereof 

JOURNAL Patent: WO 03012089-A 1 13-FEB-2003; 

Astex Technology Limited (GB) ; JANSSEN PHARMACEUTICA N.V. (BE) 
FEATURES Location/Qualifiers 
source 1. .2526 

/organism="Homo sapiens" 
/moljype-'genomic DNA" 
/db_xref="taxon:9606" 
BASE COUNT 534 a 781c 715 g 496 1 


ORIGIN 


Query Match 99.6%; Score 2062; DB6; Length 2526; 

Best Local Similarity 99.8%; Pred. No. 0; 

Matches 2065; Conservative 0; Mismatches 5; Indels 0; Gaps 0; 
Qy 1 

ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTG 
CCCAC 60 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 454 

ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTG 
CCCAC 513 

Qy 61 

GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCC 
CTGGGG 120 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 514 

GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCC 
CTGGGG 573 

Qy 121 

CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGC 
AGCTTT 180 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 574 

CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGC 
AGCTTT 633 

Qy 181 

GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGA 
TGACC 240 

llllilllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 634 

GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGA 
TGACC 693 


Qy 241 

GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTT 
TGCA 300 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 694 

GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTT 
TGCA 753 

Qy 301 

GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAG 
CACA 360 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 754 

GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAG 
CACA 813 

Qy 361 

TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGG 
GGAG 420 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 814 

TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGG 
GGAG 873 

Qy 421 

CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAA 
CATT 480 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 874 

CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAA 
CATT 933 

Qy 481 

GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATC 
CTG 540 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 


Db 934 

GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATC 
CTG 993 


Qy 541 

GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGAC 
TCT 600 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 994 

GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGAC 
TCT 1053 

Qy 601 

CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGC 
TTC 660 

lllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1054 

CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCAGCTTTGTGGTGCTGGC 
TTC 1113 

Qy 661 

CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGG 
TATC 720 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1114 

CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGG 
TATC 1173 

Qy 721 

GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTA 
TTAT 780 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1174 

GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTA 
TTAT 1233 


Qy 781 

GAGGTCATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGCAA 
GGAG 840 

lllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1234 

GAGGTGATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGCAA 
GGAG 1293 

Qy 841 

TACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCCAAG 
AAA 900 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1294 

TACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCCAAG 
AAA 1353 

Qy 901 

GTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCT 
GAT 960 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1354 

GTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCT 
GAT 1413 

Qy 961 

GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAA 
CATT1020 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1414 

GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAA 
CATT 1473 

Qy 1021 

TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCA 
CC 1080 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 


Db 1474 

TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCA 
CC 1533 

Qy 1081 

ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGA 
CTGT 1140 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1534 

ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGA 
CTGT 1593 

Qy 1141 

TACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATCATG 
GAG 1200 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1594 

TACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATCATG 
GAG 1653 

Qy 1201 

GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCT 
TGC 1260 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1654 

GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCT 
TGC 1713 

Qy 1261 

CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGA 
CATG 1320 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1714 

CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGA 
CATG 1773 


Qy 1321 

GAAGACTGTGGCTACAACATTCCACAGACAGATGAGTCAACCCTCATGACCATAGCC 
TAT 1380 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1774 

GAAGACTGTGGCTACAACATTCCACAGACAGATGAGTCAACCCTCATGACCATAGCC 
TAT 1833 

Qy 1381 

GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAG 
TGG 1440 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1834 

GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAG 
TGG 1893 

Qy 1441 

CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTG 
CTG 1500 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1894 

CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTG 
CTG 1953 

Qy 1501 

AAGTGAGGAGGCCCATGGGCAGAAGATAGAGATTCCCCTGGACCACACCTCCGTGG 
TTCA 1560 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1954 

AAGTGAGGAGGCCCATGGGCAGAAGATAGAGATTCCCCTGGACCACACCTCCGTGG 
TTCA 2013 

Qy 1561 

CTTTGGTCACAAGTAGGAGACACAGATGGCACCTGTGGCCAGAGCACCTCAGGACC 
CTCC 1620 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 


Db 2014 

CTTTGGTCACAAGTAGGAGACACAGATGGCACCTGTGGCCAGAGCACCTCAGGACC 
CTCC 2073 

Qy 1621 

CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTT 
CCAG 1680 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 2074 

CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTT 
CCAG 2133 

Qy 1681 

GGACTGTACCTGTAGGAAACAGAAAAGAGAAGAAAGAAGCACTCTGCTGGCGGGAA 
TACT 1740 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 2134 

GGACTGTACCTGTAGGAAACAGAAAAGAGAAGAAAGAAGCACTCTGCTGGCGGGAA 
TACT 2193 

Qy 1741 

CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAAC 
CT 1800 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 2194 

CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAAC 
CT 2253 

Qy 1801 

TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 
1860 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 2254 

TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 
2313 


Qy 1861 

GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGA 
GAAG 1920 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 2314 

GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGA 
GAAG 2373 

Qy 1921 

AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCT 
ATT 1980 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 2374 

AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCT 
ATT 2433 

Qy 1981 

TGCTTTAGAGACAGGGACTGTATAAACAAGCCTAACATTGGTGCAAAGATTGCCTCTT 
GA 2040 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 2434 

TGCTTTAGAGACAGGGACTGTATAAACAAGCCTAACATTGGTGCAAAGATTGCCTCTT 
GA 2493 

Qy 2041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2070 

llllllllllllll I llllllllllll 
Db 2494 ATTAAAAAAAAAAACTAGAAAAAAAAAAAA 2523 


RESULT 9 
AF 190725 

LOCUS AF190725 2526 bp mRNA linear PRI 26-OCT-1 999 

DEFINITION Homo sapiens beta-site APP cleaving enzyme (BACE) mRNA, complete 
cds. 

ACCESSION AF1 90725 

VERSION AF190725.1 Gl:61 18538 

KEYWORDS . 


SOURCE Homo sapiens (human) 
ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 2526) 

AUTHORS Vassar,R., Bennett.B.D., Babu-Khan.S., Kahn,S., Mendiaz,E.A., 

Denis.P., Teplow,D.B„ Ross,S., Amarante,P., Loeloff,R., Luo,Y., 

Fisher,S., Fuller.J., Edenson.S., Lile,J., Jarosinski,MA, 

Biere.A.L, Curran.E., Burgess,T., Louis,J.C., Collins, F., 

Treanor,J., Rogers.G. and Citron, M. 
TITLE Beta-secretase cleavage of Alzheimer's amyloid precursor protein by 

the transmembrane aspartic protease BACE 
JOURNAL Science 286 (5440), 735-741 (1999) 
MEDLINE 20002972 
PUBMED 10531052 
REFERENCE 2 (bases 1 to 2526) 
AUTHORS Bennett,B D., Vassar.R. and Citron,M. 
TITLE Direct Submission 

JOURNAL Submitted (29-SEP-1999) Neuroscience, Amgen Inc., One Amgen Center 
Dr., Thousand Oaks, CA 91320-1799, USA 
FEATURES Location/Qualifiers 
source 1. .2526 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon:9606 n 
gene 1 . .2526 

/gene="BACE" 
CDS 454. .1959 

/gene-"BACE H 
/codon_start=1 

/product- 'beta-site APP cleaving enzyme" 
/protein_id="AAF04142.r 
/db_xref="GI:61 18539" 

/translation="MAQALPWLLLWMGAGVLPAHGTQHGIRLPLRSGLGGAPLGLRLP 

RETDEEPEEPGRRGSFVEMVDNLRGKSGQGYYVEMTVGSPPQTLNILVDTGSSNFAVG 


AAPHPFLHRYYQRQLSSTYRDLRKGVYVPYTQGKWEGELGTDLVSIPHGPNVTVRANI 

AAITESDKFFINGSNWEGILGLAYAEIARPDDSLEPFFDSLVKQTHVPNLFSLQLCGA 

GFPLNQSEVLASVGGSMIIGGIDHSLYTGSLWYTPIRREWYYEVIIVRVEINGQDLKM 

DCKEYNYDKSIVDSGTTNLRLPKKVFEAAVKSIKAASSTEKFPDGFWLGEQLVCWQAG 

TTPWNIFPVISLYLMGEVTNQSFRITILPQQYLRPVEDVATSQDDCYKFAISQSSTGT 

VMGAVIMEGFYWFDRARKRIGFAVSACHVHDEFRTAAVEGPFVTLDMEDCGYNIPQT 

DESTLMTIAYVMAAICALFMLPLCLMVCQWRCLRCLRQQHDDFADDISLLK' 
BASE COUNT 534 a 781 c 715 g 496 1 
ORIGIN 

Query Match 99.6%; Score 2062; DB 9; Length 2526; 

Best Local Similarity 99.8%; Pred. No. 0; 

Matches 2065; Conservative 0; Mismatches 5; Indels 0; Gaps 0; 
Qy 1 

ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTG 
CCCAC 60 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 454 

ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTG 
CCCAC 513 

Qy 61 

GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCC 
CTGGGG 120 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 514 

GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCC 
CTGGGG 573 


r 


Qy 121 

CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGC 
AGCTTT 180 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 574 

CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGC 
AGCTTT 633 

Qy 181 

GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGA 
TGACC 240 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 634 

GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGA 
TGACC 693 

Qy 241 

GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTT 
TGCA 300 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 694 

GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTT 
TGCA 753 

Qy 301 

GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAG 
CACA 360 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 754 

GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAG 
CACA 813 

Qy 361 

TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGG 
GGAG 420 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIHIII 


Db 814 

TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGG 
GGAG 873 

Qy 421 

CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAA 
CATT 480 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 874 

CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAA 
CATT 933 

Qy 481 

GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATC 
CTG 540 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 934 

GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATC 
CTG 993 

Qy 541 

GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGAC 
TCT 600 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 994 

GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGAC 
TCT 1053 

Qy 601 

CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGC 
TTC 660 

lllllllllllllllllllllllllllllllllllllllll llllllllllllllllll 
Db 1054 

CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCAGCTTTGTGGTGCTGGC 
TTC 1113 


Qy 661 

CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGG 
TATC 720 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1114 

CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGG 
TATC 1173 

Qy 721 

GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTA 
TTAT 780 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1174 

GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTA 
TTAT 1233 

Qy 781 

GAGGTCATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGCAA 
GGAG 840 

lllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1234 

GAGGTGATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGCAA 
GGAG 1293 

Qy 841 

TACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCCAAG 
AAA 900 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1294 

TACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCCAAG 
AAA 1353 

Qy 901 

GTGTTTG AAGCTG CAGTCAAATCCATCAAGGCAG CCTCCTCCACGG AG AAGTTCCCT 
GAT 960 

lllllllllllllllllllllllllillllllllllllllllllllllllllllllllll 


Db 1354 

GTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCT 
GAT 1413 

Qy 961 

GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAA 
CATT 1020 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1414 

GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAA 
CATT 1473 

Qy 1021 

TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCA 
CC 1080 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1474 

TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCA 
CC 1533 

Qy 1081 

ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGA 
CTGT 1140 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1534 

ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGA 
CTGT 1593 

Qy 1141 

TACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATCATG 
GAG 1200 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1594 

TACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATCATG 
GAG 1653 


Qy 1201 

GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCT 
TGC 1260 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1654 

GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCT 
TGC 1713 

Qy 1261 

CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGA 
CATG 1320 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1714 

CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGA 
CATG 1773 

Qy 1321 

GAAGACTGTGGCTACAACATTCCACAGACAGATGAGTCAACCCTCATGACCATAGCC 
TAT 1380 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1774 

GAAGACTGTGGCTACAACATTCCACAGACAGATGAGTCAACCCTCATGACCATAGCC 
TAT 1833 

Qy 1381 

GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAG 
TGG 1440 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1834 

GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAG 
TGG 1893 

Qy 1441 

CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTG 
CTG 1500 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 


Db 1894 

CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTG 
CTG 1953 

Qy 1501 

AAGTGAGGAGGCCCATGGGCAGAAGATAGAGATTCCCCTGGACCACACCTCCGTGG 
TTCA1560 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1954 

AAGTGAGGAGGCCCATGGGCAGAAGATAGAGATTCCCCTGGACCACACCTCCGTGG 
TTCA2013 

Qy 1561 

CTTTGGTCACAAGTAGGAGACACAGATGGCACCTGTGGCCAGAGCACCTCAGGACC 
CTCC 1620 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 2014 

CTTTGGTCACAAGTAGGAGACACAGATGGCACCTGTGGCCAGAGCACCTCAGGACC 
CTCC 2073 

Qy 1621 

CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTT 
CCAG 1680 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 2074 

CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTT 
CCAG 2133 

Qy 1681 

GGACTGTACCTGTAGGAAACAGAAAAGAGAAGAAAGAAGCACTCTGCTGGCGGGAA 
TACT 1740 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 2134 

GGACTGTACCTGTAGGAAACAGAAAAGAGAAGAAAGAAGCACTCTGCTGGCGGGAA 
TACT 2193 


Qy 1741 

CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAAC 
CT 1800 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 2194 

CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAAC 
CT 2253 

Qy 1801 

TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 
1860 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 2254 

TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 
2313 

Qy 1861 

GTACTGG CATCACACGCAGGTTACCTTG GCGTGTGTCCCTGTGGTACCCTG GCAG A 
GAAG 1920 

llllllllllllllllllllllllllflllllllllllllllllllllilllllllllll 
Db 2314 

GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGA 
GAAG 2373 

Qy 1921 

AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCT 
ATT 1980 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 2374 

AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCT 
ATT 2433 

Qy 1981 

TGCTTTAGAGACAGGGACTGTATAAACAAGCCTAACATTGGTGCAAAGATTGCCTCTT 
GA 2040 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 


Db 2434 

TGCTTTAGAGACAGGGACTGTATAAACAAGCCTAACATTGGTGCAAAGATTGCCTCTT 
GA 2493 

Qy 2041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2070 

llllllllllllll lllllllllllll 
Db 2494 ATTAAAAAAAAAAACTAGAAAAAAAAAAAA 2523 


RESULT 10 
AF201468 

LOCUS AF201468 5878 bp mRNA linear PRI 19-DEC-1999 

DEFINITION Homo sapiens APP beta-secretase mRNA, complete cds. 

ACCESSION AF201468 

VERSION AF201468.1 Gl:6601444 

KEYWORDS . 

SOURCE Homo sapiens (human) 
ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

REFERENCE 1 (bases 1 to 5878) 
AUTHORS Sinha.S., Anderson.J.P., Barbour.R., Basi,G.S., CaccaveIlo,R., 
Davis.D., Doan.M., Dovey.H.F., Frigon.N., Hong, J., 
Jacobson-Croak,K. t Jewett,N., Keim,P., Knops,J., LieberburgJ., 
Power,M., Tan,H., Tatsuno.G., Tung.J., Schenk,D., Seubert,P., 
Suomensaari,S., Wang,S., Walker,D., Zhao,J., McConlogue,L. and 
John,V. 

TITLE Purification and cloning of amyloid precursor protein 

beta-secretase from human brain 
JOURNAL Nature 402 (6761), 537-540 (1999) 
MEDLINE 20057171 
PUBMED 10591214 
REFERENCE 2 (bases 1 to 5878) 
AUTHORS Basi.G.S., Power.M.D., Wang,S., Tatsuno,G., Frigon,N., Doan,M., 

Hong.G., Keim,P., Anderson,J., Sinha,S. and McConlogue.L.M. 
TITLE Direct Submission 

JOURNAL Submitted (03-NOV-1999) Gene Expression Group, Elan 


Pharmaceuticals, Inc., 800 Gateway Blvd., S. San Francisco, CA 
94080, USA 
FEATURES Location/Qualifiers 
source 1. .5878 

/organism="Homo sapiens" 
/moUype-'mRNA" 
/db_xref="taxon:9606" 
/chromosome-' 1 1" 
/map="D11S4352" 
/tissue_type="brain" 
CDS 455. .1960 

/function-'cleaves APP at the beta-cleavage site" 
/note="membrane type aspartyl protease" 
/codon_start=1 

/product="APP beta-secretase" 
/protein_id="AAF1 8982. 1 " 
/db_xref="GI:6601445" 

/translation="MAQALPWLLLWMGAGVLPAHGTQHGIRLPLRSGLGGAPLGLRLP 

RETDEEPEEPGRRGSFVEMVDNLRGKSGQGYYVEMTVGSPPQTLNILVDTGSSNFAVG 

AAPHPFLHRYYQRQLSSTYRDLRKGVYVPYTQGKWEGELGTDLVSIPHGPNVTVRANI 

AAITESDKFFINGSNWEGILGLAYAEIARPDDSLEPFFDSLVKQTHVPNLFSLQLCGA 

GFPLNQSEVLASVGGSMIIGGIDHSLYTGSLWYTPIRREWYYEVIIVRVEINGQDLKM 

DCKEYNYDKSIVDSGTTNLRLPKKVFEAAVKSIKAASSTEKFPDGFWLGEQLVCWQAG 

TTPWNIFPVISLYLMGEVTNQSFRITILPQQYLRPVEDVATSQDDCYKFAISQSSTGT 

VMGAVIMEGFYWFDRARKRIGFAVSACHVHDEFRTAAVEGPFVTLDMEDCGYNIPQT 

DESTLMTIAYVMAAICALFMLPLCLMVCQWRCLRCLRQQHDDFADDISLLK" 
BASE COUNT 1547 a 1518 c 1405 g 1408 1 
ORIGIN 


Query Match 99.2%; Score 2052.6; DB 9; Length 5878; 

Best Local Similarity 99.8%; Pred. No. 0; 

Matches 2055; Conservative 0; Mismatches 4; Indels 0; Gaps 0; 
Qy 1 

ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTG 
CCCAC 60 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 455 

ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTG 
CCCAC 514 

Qy 61 

GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCC 
CTGGGG 120 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 515 

GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCC 
CTGGGG 574 

Qy 121 

CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGC 
AGCTTT 180 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 575 

CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGC 
AGCTTT 634 

Qy 181 

GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGA 
TGACC 240 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 635 

GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGA 
TGACC 694 


Qy 241 

GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTT 
TGCA 300 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 695 

GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTT 
TGCA 754 

Qy 301 

GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAG 
CACA 360 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 755 

GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAG 
CACA 814 

Qy 361 

TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGG 
GGAG 420 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 815 

TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGG 
GGAG 874 

Qy 421 

CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAA 
CATT 480 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 875 

CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAA 
CATT 934 

Qy 481 

GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATC 
CTG540 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 


Db 935 

GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATC 
CTG 994 

Qy 541 

GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGAC 
TCT 600 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 995 

GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGAC 
TCT 1054 

Qy 601 

CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGC 
TTC660 

lllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1055 

CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCAGCTTTGTGGTGCTGGC 
TTC1114 

Qy 661 

CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGG 
TATC 720 

llllllllllllllllllllllllilllllllllllllllllllllllllllllllllll 
Db 1115 

CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGG 
TATC 1174 

Qy 721 

GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTA 
TTAT 780 

llllllllllllllllllllllllllllllllllllllilllllllllllllllllllll 
Db 1175 

GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTA 
TTAT 1234 


Qy 781 

GAGGTCATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGCAA 

GGAG 840 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1235 

GAGGTCATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGCAA 
GGAG 1294 

Qy 841 

TACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCCAAG 
AAA 900 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1295 

TACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCCAAG 
AAA 1354 

Qy 901 

GTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCT 
GAT 960 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1355 

GTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCT 
GAT 1414 

Qy 961 

GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAA 
C ATT 1020 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1415 

GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAA 
CATT 1474 

Qy 1021 

TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCA 
CC 1080 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 


Db 1475 

TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCA 
CC 1534 

Qy 1081 

ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGA 
CTGT1140 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1535 

ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGA 
CTGT 1594 

Qy 1141 

TACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATCATG 
GAG 1200 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1595 

TACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATCATG 
GAG 1654 

Qy 1201 

GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCT 
TGC 1260 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1655 

GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCT 
TGC 1714 

Qy 1261 

CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGA 
CATG 1320 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1715 

CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGA 
CATG 1774 


Qy 1321 

GAAGACTGTGGCTACAACATTCCACAGACAGATGAGTCAACCCTCATGACCATAGCC 

TAT 1380 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1775 

GAAGACTGTGGCTACAACATTCCACAGACAGATGAGTCAACCCTCATGACCATAGCC 
TAT 1834 

Qy 1381 

GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAG 
TGG 1440 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1835 

GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAG 
TGG 1894 

Qy 1441 

CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTG 
CTG 1500 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1895 

CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTG 
CTG 1954 

Qy 1501 

AAGTGAGGAGGCCCATGGGCAGAAGATAGAGATTCCCCTGGACCACACCTCCGTGG 
TTCA1560 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1955 

AAGTGAGGAGGCCCATGGGCAGAAGATAGAGATTCCCCTGGACCACACCTCCGTGG 
TTCA2014 

Qy 1561 

CTTTGGTCACAAGTAGGAGACACAGATGGCACCTGTGGCCAGAGCACCTCAGGACC 
CTCC 1620 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 


Db 2015 

CTTTGGTCACAAGTAGGAGACACAGATGGCACCTGTGGCCAGAGCACCTCAGGACC 
CTCC 2074 

Qy 1621 

CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTT 

CCAG 1680 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 2075 

CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTT 
CCAG 2134 

Qy 1681 

GGACTGTACCTGTAGGAAACAGAAAAGAGAAGAAAGAAGCACTCTGCTGGCGGGAA 
TACT 1740 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 2135 

GGACTGTACCTGTAGGAAACAGAAAAGAGAAGAAAGAAGCACTCTGCTGGCGGGAA 
TACT 2194 

Qy 1741 

CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAAC 
CT 1800 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 2195 

CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAAC 
CT2254 

Qy 1801 

TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 
1860 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 2255 

TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 
2314 


Qy 1861 

GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGA 
GAAG 1920 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 2315 

GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGA 
GAAG 2374 

Qy 1921 

AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCT 
ATT 1980 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 2375 

AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCT 
ATT 2434 

Qy 1981 

TGCTTTAGAGACAGGGACTGTATAAACAAGCCTAACATTGGTGCAAAGATTGCCTCTT 
GA 2040 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 2435 

TGCTTTAGAGACAGGGACTGTATAAACAAGCCTAACATTGGTGCAAAGATTGCCTCTT 
GA 2494 

Qy 2041 ATTAAAAAAAAAAAAAAAA 2059 
llllllllllllll II 

Db 2495 ATTAAAAAAAAAAACTAGA 251 3 


RESULT 11 
AB032975 

LOCUS AB032975 5814 bp mRNA linear PRI 17-MAY-2001 

DEFINITION Homo sapiens mRNA for KIAA1 149 protein, partial cds. 

ACCESSION AB032975 

VERSION AB032975.2 Gl: 141 33242 

KEYWORDS . 

SOURCE Homo sapiens (human) 


ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 (sites) 

AUTHORS Hirosawa.M., Nagase.T., Ishikawa.K., Kikuno.R., Nomura.N. and 
Ohara,0. 

TITLE Characterization of cDNA clones selected by the GeneMark analysis 

from size-fractionated cDNA libraries from human brain 
JOURNAL DNA Res. 6 (5), 329-336 (1999) 
MEDLINE 20039618 
PUBMED 10574461 
REFERENCE 2 (bases 1 to 5814) 
AUTHORS OharaA, NagaseJ. and Kikuno,R. 
TITLE Direct Submission 

JOURNAL Submitted (04-OCT-1999) Osamu Ohara, Kazusa DNA Research Institute, 
Laboratory of DNA Technology; 1532-3 Yana, Kisarazu, Chiba 
292-0812, Japan (E-mail:cdnainfo@kazusa.or.jp, 
URL:http://www.kazusa.or.jp/huge/, Tel:438-52-3913, 
Fax:438-52-3914) 

COMMENT On May 17, 2001 this sequence version replaced gi:6330044. 
FEATURES Location/Qualifiers 
source 1. .5814 

/organism="Homo sapiens" 

/mol_type="mRNA" 

/db_xref="taxon:9606" 

/clone="fg04087" 

/sex="male" 

/tissue_type="brain" 

/clone_lib= M pBluescriptll SK plus" 

/dev_stage-'adult" 

/note="vector:pBluescriptll SK plus. This sequence is 

replaced that of hg01289 cDNA as a representative cDNA 

sequence for KIAA1 149" 
gene 1. .5814 

/gene= M KIAA1149" 
CDS <351. .1949 

/gene="KIAA1149" 


/note-'Start codon is not identified." 
/codon_start=1 
/product="KIAA1 149 protein" 
/protein_id="BAA86463.2" 
/db_xref="GI:14133243" 

/translation="EATSTTQTWGQAPGTDVGQCEPRGPEGRGPPWPKPCPGSCCGWA 

RECCLPTAPSTASGCPCAAAWGGAPLGLRLPRETDEEPEEPGRRGSFVEMVDNLRGKS 

GQGYYVEM7VGSPPQTLNILVDTGSSNFAVGAAPHPFLHRYYQRQLSSTYRDLRKGVY 

VPYTQGKWEGELGTDLVSIPHGPNVTVRANIAAITESDKFFINGSNWEGILGLAYAEI 

ARPDDSLEPFFDSLVKQTHVPNLFSLQLCGAGFPLNQSEVLASVGGSMIIGGIDHSLY 

TGSLWYTPIRREWYYEVIIVRVEINGQDLKMDCKEYNYDKSIVDSGTTNLRLPKKVFE 

AAVKSIKAASSTEKFPDGFWLGEQLVCWQAGTTPWNIFPVISLYLMGEVTNQSFRITI 

LPQQYLRPVEDVATSQDDCYKFAISQSSTGTVMGAVIMEGFYWFDRARKRIGFAVSA 

CHVHDEFRTAAVEGPFVTLDMEDCGYNIPQTDESTLMTIAYVMAAICALFMLPLCLMV 

CQWRCLRCLRQQHDDFADDISLLK" 
BASE COUNT 1494 a 1510 c 1404 g 1406 t 
ORIGIN 

Query Match 98.6%; Score 2040; DB 9; Length 5814; 

Best Local Similarity 99.7%; Pred. No. 0; 

Matches 2054; Conservative 0; Mismatches 5; Indels 1; Gaps 1; 
Qy 1 

ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTG 
CCCAC 60 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 


Db 443 

ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTG 
CCCAC 502 

Qy 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCT- 
GGGGGGCGCCCCCCTGGG 119 

lllllllllllllllllllllllllllllllllllllllll llllllllllllllllll 
Db 503 

GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGGCGCCCC 
CCTGGG 562 

Qy 120 

GCTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGG 
CAGCTT 179 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 563 

GCTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGG 
CAGCTT 622 

Qy 180 

TGTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAG 
ATGAC 239 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 623 

TGTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAG 
ATGAC 682 

Qy 240 

CGTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACT 
TTGC 299 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 683 

CGTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACT 
TTGC 742 


Qy 300 

AGTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCA 

GCAC 359 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 743 

AGTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCA 
GCAC 802 

Qy 360 

ATACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAG 
GGGA419 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 803 

ATACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAG 
GGGA 862 

Qy 420 

GCTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCA 
ACAT 479 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 863 

GCTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCA 
ACAT 922 

Qy 480 

TGCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCAT 
CCT 539 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 923 

TGCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCAT 
CCT 982 

Qy 540 

GGGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTG 
ACTC 599 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 


Db 983 

GGGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTG 
ACTC 1042 

Qy 600 

TCTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGG 
CTT 659 

llllllllllllllllllllllllllllllllllllllllll lllllllllllllllll 
Db 1043 

TCTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCAGCTTTGTGGTGCTGG 
CTT 1102 

Qy 660 

CCCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAG 
GTAT719 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1103 

CCCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAG 
GTAT 1162 

Qy 720 

CGACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGT 
ATTA 779 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1163 

CGACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGT 
ATTA 1222 

Qy 780 

TGAGGTCATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGCAA 
GGA 839 

nun iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

Db 1223 

TGAGGTGATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGCAA 
GGA 1282 


Qy 840 

GTACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCCAA 
GAA 899 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1283 

GTACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCCAA 
GAA 1342 

Qy 900 

AGTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCC 
TGA 959 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1343 

AGTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCC 
TGA 1402 

Qy 960 

TGGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGA 
ACAT1019 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1403 

TGGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGA 
ACAT 1462 

Qy 1020 

TTTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATC 
AC 1079 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1463 

TTTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATC 
AC 1522 

Qy 1080 

CATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACG 
ACTG 1139 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 


Db 1523 

CATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACG 
ACTG 1582 

Qy 1140 

TTACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATCATG 
GA 1199 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1583 

TTACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATCATG 
GA 1642 

Qy 1200 

GGGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGC 
TTG 1259 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1643 

GGGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGC 
TTG 1702 

Qy 1260 

CCATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGG 
ACAT 1319 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1703 

CCATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGG 
ACAT 1762 

Qy 1320 

GGAAGACTGTGGCTACAACATTCCACAGACAGATGAGTCAACCCTCATGACCATAGC 
CTA 1379 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1763 

GGAAGACTGTGGCTACAACATTCCACAGACAGATGAGTCAACCCTCATGACCATAGC 
CTA 1822 


Qy 1380 

TGTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCA 
GTG 1439 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1823 

TGTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCA 
GTG 1882 

Qy 1440 

GCGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCT 
GCT 1499 

llllllllllllllllllllllllllllllllllllllllllllllilllllllllllll 
Db 1883 

GCGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCT 
GCT 1942 

Qy 1500 

GAAGTGAGGAGGCCCATGGGCAGAAGATAGAGATTCCCCTGGACCACACCTCCGTG 
GTTC 1559 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1943 

GAAGTGAGGAGGCCCATGGGCAGAAGATAGAGATTCCCCTGGACCACACCTCCGTG 
GTTC 2002 

Qy 1560 

ACTTTGGTCACAAGTAGGAGACACAGATGGCACCTGTGGCCAGAGCACCTCAGGAC 
CCTC 1619 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 2003 

ACTTTGGTCACAAGTAGGAGACACAGATGGCACCTGTGGCCAGAGCACCTCAGGAC 
CCTC 2062 

Qy 1620 

CCCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGT 
TCCA 1679 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 


Db 2063 

CCCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGT 
TCCA2122 

Qy 1680 

GGGACTGTACCTGTAGGAAACAGAAAAGAGAAGAAAGAAGCACTCTGCTGGCGGGA 
ATAC 1739 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 2123 

GGGACTGTACCTGTAGGAAACAGAAAAGAGAAGAAAGAAGCACTCTGCTGGCGGGA 
ATAC 2182 

Qy 1740 

TCTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAA 
CC 1799 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 2183 

TCTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAA 
CC2242 

Qy 1800 

TTTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGA 
1859 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 2243 

TTTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGA 
2302 

Qy 1860 

AGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGA 
GAA 1919 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 2303 

AGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGA 
GAA 2362 


Qy 1920 

GAGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGC 
TAT 1979 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 2363 

GAGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGC 
TAT 2422 

Qy 1980 

TTGCTTTAGAGACAGGGACTGTATAAACAAGCCTAACATTGGTGCAAAGATTGCCTCT 
TG 2039 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 2423 

TTGCTTTAGAGACAGGGACTGTATAAACAAGCCTAACATTGGTGCAAAGATTGCCTCT 
TG 2482 

Qy 2040 AATTAAAAAAAAAAAAAAAA 2059 
lllllllllllllll I I 

Db 2483 AATTAAAAAAAAAAACTAGA 2502 


RESULT 12 
AR1 78469 

LOCUS AR178469 2541 bp DNA linear PAT 20-APR-2002 

DEFINITION Sequence 1 from patent US 6319689. 

ACCESSION AR1 78469 

VERSION AR1 78469.1 Gl:20219607 

KEYWORDS . 

SOURCE Unknown. 

ORGANISM Unknown. 
Unclassified. 
REFERENCE 1 (bases 1 to 2541) 

AUTHORS Powell.D.J., Chapman.C.G., Murphy.K- and Smith.T.S. 

TITLE ASP2 

JOURNAL Patent: US 6319689-A 1 20-NOV-2001; 
FEATURES Location/Qualifiers 
source 1. .2541 


/organism="un known" 
BASE COUNT 598 a 673 c 675 g 579 1 16 others 
ORIGIN 

Query Match 98.2%; Score 2032; DB 6; Length 2541; 

Best Local Similarity 99.5%; Pred. No. 0; 

Matches 2049; Conservative 0; Mismatches 10; Indels 1; Gaps 1; 
Qy 1 

ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTG 
CCCAC 60 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1 

ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTG 
CCCAC 60 

Qy 61 

GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCC 
CTGGGG 120 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 61 

GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCC 
CTGGGG 120 

Qy 121 

CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGC 
AGCTTT 180 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 121 

CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGC 
AGCTTT 180 

Qy 181 

GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGA 
TGACC 240 

lllllllllllllllllllllllllllllllllillllllllllllllllllllllllll 


Db 181 

GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGA 
TGACC 240 

Qy 241 

GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTT 

TGCA 300 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 241 

GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTT 
TGCA 300 

Qy 301 

GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAG 
CACA 360 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 301 

GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAG 
CACA 360 

Qy 361 

TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGG 

GGAG 420 

lllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 361 

TACCGGGACCTCCGGAAGGGTGTGTATGAGCCCTACACCCAGGGCAAGTGGGAAG 
GGGAG 420 

Qy 421 

CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAA 

CATT 480 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 421 

CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAA 
CATT 480 


Qy 481 

GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATC 

CTG 540 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 481 

GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATC 
CTG 540 

Qy 541 

GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGAC 
TCT 600 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 541 

GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGAC 
TCT 600 

Qy 601 

CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGC 
TTC 660 

lllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 601 

CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCAGCTTTGTGGTGCTGGC 
TTC 660 

Qy 661 

CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGG 
TATC 720 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 661 

CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGG 
TATC 720 

Qy 721 

GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTA 
TTAT 780 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 


Db 721 

GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTA 
TTAT 780 

Qy 781 

GAGGTCATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGCAA 

GGAG 840 

IIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
Db 781 

GAGGTGATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGCAA 
GGAG 840 

Qy 841 

TACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCCAAG 
AAA 900 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 841 

TACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCCAAG 
AAA 900 

Qy 901 

GTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCT 
GAT 960 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 901 

GTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCT 
GAT 960 

Qy 961 

GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAA 

CATT 1020 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 961 

GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAA 
CATT 1020 


Qy 1021 

TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCA 
CC 1080 

llllllllllllllllllllllllllllllllllllllllllllllllllilllllllll 
Db 1021 

TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCA 
CC 1080 

Qy 1081 

ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGA 

CTGT 1140 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1081 

ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGA 
CTGT 1 140 

Qy 1141 

TACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATCATG 
GAG 1200 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1141 

TACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATCATG 
GAG 1200 

Qy 1201 

GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCT 

TGC 1260 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1201 

GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCT 
TGC 1260 

Qy 1261 

CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGA 
CATG 1320 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 


Db 1261 

CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGA 
CATG 1320 

Qy 1321 

GAAGACTGTGGCTACAACATTCCACAGACAGATGAGTCAACCCTCATGACCATAGCC 

TAT 1380 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1321 

GAAGACTGTGGCTACAACATTCCACAGACAGATGAGTCAACCCTCATGACCATAGCC 
TAT 1380 

Qy 1381 

GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAG 
TGG 1440 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1381 

GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAG 
TGG 1440 

Qy 1441 

CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTG 

CTG 1500 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1441 

CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTG 
CTG 1500 

Qy 1 50 1 AAGTGAGG AGGCCCATGGGCAGAAG ATAGAGATTCCCCT- 
GGACCACACCTCCGTGGTTC 1559 

lllllllllllllllllll lllllllllllllllll llllllllllllllllllll 
Db 1501 

AAGTGAGGAGGCCCATGGGAGAAAGATAGAGATTCCCCTGGGACCACACCTCCGTG 
GTTC 1560 


Qy 1560 

ACTTTGGTCACAAGTAGGAGACACAGATGGCACCTGTGGCCAGAGCACCTCAGGAC 

CCTC 1619 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1561 

ACTTTGGTCACAAGTAGGAGACACAGATGGCACCTGTGGCCAGAGCACCTCAGGAC 
CCTC 1620 

Qy 1620 

CCCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGT 

TCCA 1679 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1621 

CCCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGT 
TCCA 1680 

Qy 1680 

GGGACTGTACCTGTAGGAAACAGAAAAGAGAAGAAAGAAGCACTCTGCTGGCGGGA 
ATAC 1739 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1681 

GGGACTGTACCTGTAGGAAACAGAAAAGAGAAGAAAGAAGCACTCTGCTGGCGGGA 
ATAC 1740 

Qy 1740 

TCTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAA 
CC 1799 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1741 

TCTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAA 
CC 1800 

Qy 1800 

TTTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGA 
1859 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 


Db 1801 

TTTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGA 
1860 

Qy 1860 

AGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGA 

GAA 1919 

llllllllllllllllllllllllllllllllllllllllllllllllll lllllllll 
Db 1861 

AGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCGGGCAG 
AGAA 1920 

Qy 1920 

GAGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGC 
TAT 1979 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1921 

GAGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGC 
TAT 1980 

Qy 1980 

TTGCTTTAGAGACAGGGACTGTATAAACAAGCCTAACATTGGTGCAAAGATTGCCTCT 
TG 2039 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1981 

TTGCTTTAGAGACAGGGACTGTATAAACAAGCCTAACATTGGTGCAAAGATTGCCTCT 
TG 2040 

Qy 2040 AATTAAAAAAAAAAAAAAAA 2059 
lllllllllllllll II 

Db 2041 AATTAAAAAAAAAAACTAGA 2060 


RESULT 13 
AX002655 

LOCUS AX002655 2541 bp DNA linear PAT 1 0-MAR-2000 

DEFINITION Sequence 1 from Patent EP0855444. 


ACCESSION AX002655 
VERSION AX002655.1 Gl:7242133 
KEYWORDS . 
SOURCE unidentified 
ORGANISM unidentified 
unclassified. 
REFERENCE 1 (bases 1 to 2541) 
AUTHORS Murphy.K. and Chapman.C.G. 
TITLE Aspartic proteinase 2 (ASP2) 
JOURNAL Patent: EP 0855444-A 1 29-JUL-1998; 

SMITHKLINE BEECHAM PLC (GB); SMITHKLINE BEECHAM CORP (US) 
FEATURES Location/Qualifiers 
source 1. .2541 

/organism-'unidentified" 
/mol_type="genomic DNA" 
/db_xref="taxon:32644" 
BASE COUNT 598 a 673 c 675 g 579 1 16 others 
ORIGIN 

Query Match 98.2%; Score 2032; DB 6; Length 2541; 

Best Local Similarity 99.5%; Pred. No. 0; 

Matches 2049; Conservative 0; Mismatches 10; Indels 1; Gaps 1; 
Qy 1 

ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTG 
CCCAC 60 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1 

ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTG 
CCCAC 60 

Qy 61 

GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCC 
CTGGGG 120 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 


Db 61 

GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCC 
CTGGGG 120 

Qy 121 

CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGC 

AGCTTT 180 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 121 

CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGC 
AGCTTT 180 

Qy 181 

GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGA 
TGACC 240 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 181 

GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGA 
TGACC 240 

Qy 241 

GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTT 

TGCA 300 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 241 

GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTT 
TGCA 300 

Qy 301 

GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAG 

CACA 360 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 301 

GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAG 
CACA 360 


Qy 361 

TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGG 

GGAG 420 

lllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 361 

TACCGGGACCTCCGGAAGGGTGTGTATGAGCCCTACACCCAGGGCAAGTGGGAAG 
GGGAG 420 

Qy 421 

CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAA 

CATT 480 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 421 

CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAA 
CATT 480 

Qy 481 

GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATC 

CTG 540 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 481 

GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATC 
CTG 540 

Qy 541 

GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGAC 
TCT 600 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 541 

GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGAC 
TCT 600 

Qy 601 

CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGC 
TTC660 

lllllllllllllllllllllllllllllllllllllllll llllllllllllllllll 


Db 601 

CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCAGCTTTGTGGTGCTGGC 
TTC 660 

Qy 661 

CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGG 

TATC 720 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 661 

CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGG 
TATC 720 

Qy 721 

GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTA 
TTAT 780 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 721 

GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTA 
TTAT 780 

Qy 781 

GAGGTCATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGCAA 

GGAG 840 

lllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 781 

GAGGTGATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGCAA 
GGAG 840 

Qy 841 

TACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCCAAG 
AAA 900 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 841 

TACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCCAAG 
AAA 900 


Qy 901 

GTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCT 
GAT 960 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 901 

GTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCT 
GAT 960 

Qy 961 

GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAA 

CATT1020 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 961 

GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAA 
CATT1020 

Qy 1021 

TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCA 
CC 1080 

llllllllllllllllllllllllllllllllllllllllilllllllllllllllllll 
Db 1021 

TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCA 
CC 1080 

Qy 1081 

ATCCTTCCGCAGCAATACCTGCGG CCAGTG GAAGATGTGG CCACGTCCCAAG ACG A 
CTGT1140 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1081 

ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGA 
CTGT 1140 

Qy 1141 

TACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATCATG 
GAG 1200 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 


Db 1141 

TACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATCATG 
GAG 1200 

Qy 1201 

GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCT 

TGC 1260 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1201 

GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCT 
TGC 1260 

Qy 1261 

CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGA 
CATG 1320 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1261 

CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGA 
CATG 1320 

Qy 1321 

GAAGACTGTGGCTACAACATTCCACAGACAGATGAGTCAACCCTCATGACCATAGCC 
TAT 1380 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1321 

GAAGACTGTGGCTACAACATTCCACAGACAGATGAGTCAACCCTCATGACCATAGCC 
TAT 1380 

Qy 1381 

GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAG 
TGG 1440 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1381 

GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAG 
TGG 1440 


Qy 1441 

CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTG 

CTG 1500 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1441 

CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTG 
CTG 1500 

Qy 1501 AAGTGAGGAGGCCCATGGGCAGAAGATAGAGATTCCCCT- 
GGACCACACCTCCGTGGTTC 1559 

lllllllllllllllllll lllllllllllllllll llllllllllllllllllll 
Db 1501 

AAGTGAGGAGGCCCATGGGAGAAAGATAGAGATTCCCCTGGGACCACACCTCCGTG 
GTTC 1560 

Qy 1560 

ACTTTGGTCACAAGTAGGAGACACAGATGGCACCTGTGGCCAGAGCACCTCAGGAC 
CCTC 1619 

llllllllllllllllllllllllllllllllllllllllllllllllllllllilllll 
Db 1561 

ACTTTGGTCACAAGTAGGAGACACAGATGGCACCTGTGGCCAGAGCACCTCAGGAC 
CCTC 1620 

Qy 1620 

CCCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGT 
TCCA1679 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1621 

CCCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGT 
TCCA 1680 

Qy 1680 

GGGACTGTACCTGTAGGAAACAGAAAAGAGAAGAAAGAAGCACTCTGCTGGCGGGA 
ATAC 1739 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 


Db 1681 

GGGACTGTACCTGTAGGAAACAGAAAAGAGAAGAAAGAAGCACTCTGCTGGCGGGA 
ATAC 1740 

Qy 1740 

TCTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAA 
CC 1799 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1741 

TCTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAA 
CC 1800 

Qy 1800 

TTTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGA 
1859 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1801 

TTTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGA 
1860 

Qy 1860 

AGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGA 
GAA 1919 

llllllllllllllllllllllllllllllllllllllllllllllllll lllllllll 
Db 1861 

AGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCGGGCAG 
AGAA 1920 

Qy 1920 

GAGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGC 
TAT 1979 

lllllllllllllllllllllllllllllllllllllllllllillllllllllllllll 
Db 1921 

GAGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGC 
TAT 1980 


Qy 1980 

TTGCTTTAGAGACAGGGACTGTATAAACAAGCCTAACATTGGTGCAAAGATTGCCTCT 
TG 2039 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1981 

TTGCTTTAGAGACAGGGACTGTATAAACAAGCCTAACATTGGTGCAAAGATTGCCTCT 
TG 2040 

Qy 2040 AATTAAAAAAAAAAAAAAAA 2059 
lllllllllllllll II 

Db 2041 AATTAAAAAAAAAAACTAGA 2060 


RESULT 14 
AX700452 

LOCUS AX700452 2541 bp DNA linear PAT 03-APR-2003 

DEFINITION Sequence 7 from Patent WO03012089. 

ACCESSION AX700452 

VERSION AX700452.1 Gl:29536241 

KEYWORDS . 

SOURCE Homo sapiens (human) 
ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 

AUTHORS Yon.J., CleasbyA, Bruinzeel.W.D., Masure.S.L, Tickle.l. and 
SharffA 

TITLE Crystal structure of beta-site app cleaving enzyme (bace) and use 
thereof 

JOURNAL Patent: WO 03012089-A 7 13-FEB-2003; 

Astex Technology Limited (GB) ; JANSSEN PHARMACEUTICA N.V. (BE) 
FEATURES Location/Qualifiers 
source 1. .2541 

/organism-'Homo sapiens" 
/mol_type="genomic DNA" 
/db_xref="taxon:9606" 
BASE COUNT 598 a 673 c 675 g 579 1 16 others 


ORIGIN 


Query Match 98.2%; Score 2032; DB6; Length 2541; 

Best Local Similarity 99.5%; Pred. No. 0; 

Matches 2049; Conservative 0; Mismatches 10; Indels 1; Gaps 1; 
Qy 1 

ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTG 
CCCAC 60 

lllllllllllllllllllllllllllllllllllllllllllllllllllillllllll 
Db 1 

ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTG 
CCCAC 60 

Qy 61 

GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCC 
CTGGGG 120 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 61 

GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCC 
CTGGGG 120 

Qy 121 

CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGC 
AGCTTT 180 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 121 

CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGC 
AGCTTT 180 

Qy 181 

GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGA 
TGACC 240 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 181 

GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGA 
TGACC 240 


Qy 241 

GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTT 

TGCA 300 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 241 

GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTT 
TGCA 300 

Qy 301 

GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAG 
CACA 360 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 301 

GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAG 
CACA 360 

Qy 361 

TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGG 
GGAG 420 

lllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 361 

TACCGGGACCTCCGGAAGGGTGTGTATGAGCCCTACACCCAGGGCAAGTGGGAAG 
GGGAG 420 

Qy 421 

CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAA 

CATT 480 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 421 

CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAA 
CATT 480 

Qy 481 

GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATC 
CTG540 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 


Db 481 

GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATC 
CTG 540 

Qy 541 

GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGAC 
TCT 600 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 541 

GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGAC 
TCT 600 

Qy 601 

CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGC 
TTC 660 

lllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 601 

CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCAGCTTTGTGGTGCTGGC 
TTC 660 

Qy 661 

CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGG 

TATC 720 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 661 

CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGG 
TATC 720 

Qy 721 

GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTA 

TTAT 780 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 721 

GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTA 
TTAT 780 


Qy 781 

GAGGTCATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGCAA 

GGAG 840 

lllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 781 

GAGGTGATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGCAA 
GGAG 840 

Qy 841 

TACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCCAAG 
AAA 900 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 841 

TACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCCAAG 
AAA 900 

Qy 901 

GTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCT 
GAT 960 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 901 

GTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCT 
GAT 960 

Qy 961 

GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAA 

CATT 1020 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 961 

GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAA 
CATT 1020 

Qy 1021 

TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCA 
CC 1080 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 


Db 1021 

TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCA 
CC 1080 

Qy 1081 

ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGA 

CTGT 1140 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1081 

ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGA 
CTGT 1140 

Qy 1141 

TACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATCATG 

GAG 1200 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1141 

TACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATCATG 
GAG 1200 

Qy 1201 

GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCT 

TGC 1260 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1201 

GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCT 
TGC 1260 

Qy 1261 

CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGA 

CATG 1320 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1261 

CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGA 
CATG 1320 


Qy 1321 

GAAGACTGTGGCTACAACATTCCACAGACAGATGAGTCAACCCTCATGACCATAGCC 

TAT 1380 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1321 

GAAGACTGTGGCTACAACATTCCACAGACAGATGAGTCAACCCTCATGACCATAGCC 
TAT 1380 

Qy 1381 

GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAG 

TGG 1440 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1381 

GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAG 
TGG 1440 

Qy 1441 

CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTG 

CTG 1500 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1441 

CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTG 
CTG 1500 

Qy 1501 AAGTGAGGAGGCCCATGGGCAGAAGATAGAGATTCCCCT- 
GGACCACACCTCCGTGGTTC 1559 

lllllllllllllllllll lllllllllllllllll llllllllllllllllllll 
Db 1501 

AAGTGAGGAGGCCCATGGGAGAAAGATAGAGATTCCCCTGGGACCACACCTCCGTG 
GTTC 1560 

Qy 1560 

ACTTTGGTCACAAGTAGGAGACACAGATGGCACCTGTGGCCAGAGCACCTCAGGAC 
CCTC 1619 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 


Db 1561 

ACTTTGGTCACAAGTAGGAGACACAGATGGCACCTGTGGCCAGAGCACCTCAGGAC 
CCTC 1620 

Qy 1620 

CCCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGT 

TCCA 1679 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1621 

CCCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGT 
TCCA 1680 

Qy 1680 

GGGACTGTACCTGTAGGAAACAGAAAAGAGAAGAAAGAAGCACTCTGCTGGCGGGA 

ATAC 1739 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1681 

GGGACTGTACCTGTAGGAAACAGAAAAGAGAAGAAAGAAGCACTCTGCTGGCGGGA 
ATAC 1740 

Qy 1740 

TCTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAA 
CC 1799 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1741 

TCTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAA 
CC 1800 

Qy 1800 

TTTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGA 
1859 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1801 

TTTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGA 
1860 


Qy 1860 

AGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGA 

GAA1919 

llllllllllllllllllllllllllllllllllllllllllllllllll IIIIIIIH 
Db 1861 

AGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCGGGCAG 
AG AA 1920 

Qy 1920 

GAGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGC 

TAT 1979 

llllllllllllllllllllllilllllllllllllllllllllllllllllllllllll 
Db 1921 

GAGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGC 
TAT 1980 

Qy 1980 

TTGCTTTAGAGACAGGGACTGTATAAACAAGCCTAACATTGGTGCAAAGATTGCCTCT 
TG 2039 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1981 

TTGCTTTAGAGACAGGGACTGTATAAACAAGCCTAACATTGGTGCAAAGATTGCCTCT 
TG 2040 

Qy 2040 AATTAAAAAAAAAAAAAAAA 2059 
lllllllllllllll II 

Db 2041 AATTAAAAAAAAAAACTAGA 2060 


RESULT 15 
E50816 

LOCUS E50816 2541 bp DNA linear PAT 18-JUN-2001 

DEFINITION Aspartate proteinase ASP2. 

ACCESSION E50816 

VERSION E50816.1 GM3023199 

KEYWORDS JP2000060579-A/1. 

SOURCE unidentified 


ORGANISM unidentified 

unclassified. 
REFERENCE 1 (bases 1 to 2541) 
AUTHORS David.J.P., Conrad.G.C., Kay.M. and Trudy.S.S. 
TITLE Aspartate proteinase ASP2 
JOURNAL Patent: JP 2000060579-A 1 29-FEB-2000; 

SMITHKLINE BEECHAM CORP 
COMMENT OS Unidentified 

PN JP 2000060579-A/1 

PD 29-FEB-2000 

PF 03-AUG-1999 JP 1999219665 

PR 28-JAN-1997 GB 9701684:4 

PI DAVID J POWERU.CONRAD G CHAPPUMAN.KAY MAFI.TRUDY S SMITH 

PC 

C12N15/09.A61 K31/7088,A61 K38/46.A61 K39/00.A61 K39/395, PC 
A61 K39/395.A61 K48/00, 

PC A61P25/28,A61P35/00,A61P43/00,C07K16/40 ) C12N1/15,C12N1/19, PC 
C12N1/21, 

PC C12N5/10,C12N9/64,C12Q1/37,G01N33/15,G01N33/50,G01N33/53, PC 
G01N33/566, 

PC G01N33/577//C12P21/08,C12N15/00,A61K37/54,C12N5/00 CC 
Strandedness: Single; 
CC Topology: Linear; 
FH Key Location/Qualifiers 
FT source 1. .2541 
FT /organism='Unidentified'. 
FEATURES Location/Qualifiers 
source 1. .2541 

/organism-'unidentified" 
/mol_type="genomic DNA" 
/db_xref="taxon:32644" 
BASE COUNT 598 a 673 c 675 g 579 1 16 others 
ORIGIN 

Query Match 98.2%; Score 2032; DB6; Length 2541; 

Best Local Similarity 99.5%; Pred. No. 0; 

Matches 2049; Conservative 0; Mismatches 10; Indels 1; Gaps 1; 


Qy 1 

ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTG 
CCCAC 60 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

Db 1 

ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTG 
CCCAC 60 

Qy 61 

GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCC 

CTGGGG 120 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 61 

GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCC 
CTGGGG 120 

Qy 121 

CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGC 

AGCTTT 1 80 

llllllllllllllilllllllllllllllllllllllllllllllllllllllllllll 
Db 121 

CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGC 
AGCTTT 180 

Qy 181 

GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGA 

TGACC 240 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 181 

GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGA 
TGACC 240 

Qy 241 

GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTT 
TGCA 300 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 


Db 241 

GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTT 
TGCA 300 

Qy 301 

GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAG 

CACA 360 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 301 

GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAG 
CACA 360 

Qy 361 

TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGG 

GGAG 420 

llllllllllllllllllllilllllll lllllllllllllllllllllllllllllll 
Db 361 

TACCGGGACCTCCGGAAGGGTGTGTATGAGCCCTACACCCAGGGCAAGTGGGAAG 
GGGAG 420 

Qy 421 

CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAA 

CATT 480 

llilllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 421 

CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAA 
CATT 480 

Qy 481 

GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATC 

CTG 540 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 481 

GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATC 
CTG 540 


Qy 541 

GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGAC 
TCT 600 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 541 

GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGAC 
TCT 600 

Qy 601 

CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGC 
TTC 660 

lllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 601 

CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCAGCTTTGTGGTGCTGGC 
TTC 660 

Qy 661 

CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGG 
TATC 720 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 661 

CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGG 
TATC 720 

Qy 721 

GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTA 

TTAT 780 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 721 

GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTA 
TTAT 780 

Qy 781 

GAGGTCATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGCAA 
GGAG 840 

Hill llllllllllllllllllllllllllllllllllllllllllllllllllllll 


Db 781 

GAGGTGATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGCAA 
GGAG 840 

Qy 841 

TACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCCAAG 
AAA 900 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 841 

TACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCCAAG 
AAA 900 

Qy 901 

GTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCT 
GAT 960 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 901 

GTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCT 
GAT 960 

Qy 961 

GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAA 

CATT 1020 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 961 

GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAA 
CATT 1020 

Qy 1021 

TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCA 
CC 1080 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1021 

TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCA 
CC 1080 


Qy 1081 

ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGA 

CTGT1140 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1081 

ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGA 
CTGT 1 140 

Qy 1141 

TACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATCATG 

GAG 1200 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1141 

TACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATCATG 
GAG 1200 

Qy 1201 

GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCT 

TGC 1260 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1201 

GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCT 
TGC 1260 

Qy 1261 

CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGA 

CATG1320 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1261 

CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGA 
CATG 1320 

Qy 1321 

GAAGACTGTGGCTACAACATTCCACAGACAGATGAGTCAACCCTCATGACCATAGCC 
TAT 1380 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 


Db 1321 

GAAGACTGTGGCTACAACATTCCACAGACAGATGAGTCAACCCTCATGACCATAGCC 
TAT 1380 

Qy 1381 

GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAG 
TGG 1440 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1381 

GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAG 
TGG 1440 

Qy 1441 

CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTG 
CTG 1500 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1441 

CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTG 
CTG 1500 

Qy 1 50 1 AAGTG AGG AGGCCCATGGGCAG AAG ATAG AG ATTCCCCT- 
GGACCACACCTCCGTGGTTC 1559 

lllllllllllllllllll lllllllllllllllllllllllllllllllllllll 
Db 1501 

AAGTGAGGAGGCCCATGGGAGAAAGATAGAGATTCCCCTGGGACCACACCTCCGTG 
GTTC 1560 

Qy 1560 

ACTTTGGTCACAAGTAGGAGACACAGATGGCACCTGTGGCCAGAGCACCTCAGGAC 
CCTC1619 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1561 

ACTTTGGTCACAAGTAGGAGACACAGATGGCACCTGTGGCCAGAGCACCTCAGGAC 
CCTC 1620 


Qy 1620 

CCCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGT 

TCCA 1679 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1621 

CCCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGT 
TCCA 1680 

Qy 1680 

GGGACTGTACCTGTAGGAAACAGAAAAGAGAAGAAAGAAGCACTCTGCTGGCGGGA 

ATAC 1739 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1681 

GGGACTGTACCTGTAGGAAACAGAAAAGAGAAGAAAGAAGCACTCTGCTGGCGGGA 
ATAC 1740 

Qy 1740 

TCTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAA 
CC 1799 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1741 

TCTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAA 
CC 1800 

Qy 1800 

TTTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGA 
1859 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1801 

TTTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGA 
1860 

Qy 1860 

AGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGA 
GAA1919 

llllllllllllllllllllllllllllllllllllllllllllllllll lllllllll 


Db 1861 

AGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCGGGCAG 
AGAA 1920 

Qy 1920 

GAGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGC 
TAT 1979 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1921 

GAGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGC 
TAT 1980 

Qy 1980 

TTGCTTTAGAGACAGGGACTGTATAAACAAGCCTAACATTGGTGCAAAGATTGCCTCT 
TG 2039 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1981 

TTGCTTTAGAGACAGGGACTGTATAAACAAGCCTAACATTGGTGCAAAGATTGCCTCT 
TG 2040 

Qy 2040 AATTAAAAAAAAAAAAAAAA 2059 
lllllllllllllll II 

Db 2041 AATTAAAAAAAAAAACTAGA 2060 
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24 
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24 
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1843.4 
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1 
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19 
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27 
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4 
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22 
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22 
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DNA encoding mouse 
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AAD06741 
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ABL52459 

Mouse Asp-2 (a) nuc 
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23 
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24 
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9 
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22 

AAD09485 

Human aspartyl pro 

42 
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7 
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23 

AAS73798 
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43 
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4 
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24 
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44 
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7 
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21 

AAA15688 

Modified human asp 
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7 
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22 
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ALIGNMENTS 


RESULT 1 
AAA15662 

ID AAA15662 standard; cDNA; 2070 BP. 
XX 

AC AAA15662; 
XX 

DT 03-AUG-2000 (first entry) 
XX 

DE Human aspartyl protease 2 (a) (Asp2) nucleotide sequence. 
XX 

KW Aspartyl protease; aspartase; amyloid precursor protein; APP; Asp 2; 

KW Alzheimer's disease; beta secretase site; ss. 

XX 

OS Homo sapiens. 
XX 

PN WO200017369-A2 . 
XX 

PD 30-MAR-2000. 
XX 

PF 23-SEP-1999; 99WO-US20881 . 
XX 

PR 24-SEP-1998; 98US-0 101594 . 
XX 

PA ( PHAA ) PHARMACIA & UPJOHN CO. 
XX 

PI Gurney ME, Bienkowski MJ, Heinrikson RL, Parodi LA, Yan R; 
XX 

DR WPI; 2000-303209/26. 

DR P-PSDB; AAY88425. 
XX 

PT New enzyme designated human aspartase useful in research into 

PT Alzheimer's Disease is capable of cleaving amyloid protein precursor at 

PT the beta secretase site to produce amyloid beta peptide - 

XX 

PS Claim 5; Fig 2; 183pp; English. 
XX 

CC This sequence represents the human aspartyl protease 2 (Asp2) nucleotide 

CC sequence. The invention relates to a protease (e.g. Asp2) capable of 

CC cleaving the beta secretase site of amyloid precursor protein (APP) . The 

CC protease contains a sequence encoding the amino acid sequence DTG and a 

CC sequence encoding DSG or DTG separated by 100-300 amino acids. When 

CC mutated the APP gene causes an autosomal dominant form of Alzheimer's 

CC disease. APP localises to the cell surface membrane and have a single 

CC C-terminal transmembrane domain. Proteolytic processing of APP produces 

CC the amyloid beta protein, which is possibly very important in Alzheimer's 

CC disease. The invention includes a nucleotide sequence encoding the 

CC protease, a vector containing the nucleotide sequence, and a cell line 

CC comprising the vector. Methods for screening for inhibitors of beta 

CC secretase activity are also given in the invention. The human aspartase 

CC protein and nucleotide sequences and the methods for identifying 

CC inhibitors of the protease, are useful in the treatment of and research 

CC in to Alzheimer's disease. 

XX 

SQ Sequence 2070 BP; 476 A; 583 C; 562 G; 449 T; 0 other; 

Query Match 100.0%; Score 2070; DB 21; Length 2070; 


Best Local Similarity 100.0%; Pred. No. 0; 

Matches 2 070; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 


Qy 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

Qy 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 120 

Qy 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 180 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 180 

Qy 181 GT GGAGATGGT GGACAACCTGAGGGGCAAGT CGGGGCAGGGCTACT AC GTGGAGAT GACC 240 

1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 

Db 181 GT GGAGAT GGT GGACAACCT GAGGGGCAAGT CGGGGCAGGGCTACTAC GTGGAGAT GACC 240 

Qy 241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I I I 
Db 241 GTGGGCAGCCCCCCGCAGACGCTC7\ACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 300 

Qy 301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

Qy 361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 420 

I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 420 

Qy 421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 4 80 

Qy 481 GCT GC CAT CACT GAAT C AGACAAGT T CT T CAT CAAC GG CT C CAACT GG GAAGGC AT C CT G 540 

I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I II I I I I I 
Db 481 GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATCCTG 54 0 

Qy 541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I II I I I I II I I 
Db 541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

Qy 601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
Db 601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

Qy 661 CCC CT CAACCAGT CT GAAGT GCT GGCCT CT GT CGGAGGGAGCAT GAT CATT GGAGGTAT C 72 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II M I I I I I I I I I 

Db 661 CCC CT CAAC CAGT CT GAAGT GCTGGCCT CT GT C GGAGGGAGC AT GAT CAT T GGAGGTAT C 720 

Qy 721 GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTATTAT 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | I I 

Db 721 GAC CACT C GC T GT ACAC AG GC AGT CT CT GGTAT ACACC CAT C CGGC GGGAGT G GT AT TAT 780 

Qy 7 81 GAGGT CAT CAT T GT GC GGGT GGAGAT CAAT GGACAGGAT CT GAAAAT GGAC T GCAAGGAG 84 0 

I I I I I I I I I I I I I II I I I I I I I I I II | | | | | | | | | | | | | | | | I I I I I I I I I I I I | | | | | | 


Db 


781 


GAGGT CAT CATT GT GC GGGT GGAGAT CAAT G GACAG GAT CTGAAAAT GG ACT GCAAGGAG 84 0 


Qy 841 TACAACTAT GACAAGAGCATT GT GGACAGT GGCACCACCAACCTT CGTTT GC CCAAGAAA 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 841 TACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCT^CCTTCGTTTGCCCAAGAAA 900 

Qy 901 GT GT T TGAAGCT GCAGT CAAAT C CAT CAAGGCAGC C T C CT CC AC GGAGAAGT T C CCT GAT 960 

I II I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I II I I I I I I I I I I I I I I I I 
Db 901 GT GT T T GAAGCT GCAGT CAAAT C CAT CAAG GC AGC CT C CT CC AC GGAGAAGT T C CCT GAT 960 

Qy 961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 1020 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 1020 

Qy 1021 T T C C CAGT CAT CT CACT CT AC CT AAT GGGT GAGGTT AC CAAC C AGT C CT T C C GCAT CAC C 108 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1021 TTCCCAGTCATCT CACT CT AC CTAATGGGTGAGGTTAC CAAC CAGT CCTTCCGCAT CAC C 108 0 

Qy 1081 AT C C TT C C GC AGCAAT AC CT GC G GC CAGT GGAAGAT GT GGCCAC GT C C CAAGAC GACT GT 1140 

I I I I I I I I I I I I I I I M I I I I I I M I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 1081 AT C CTT C C GCAGCAAT AC CT GC GGC CAGT GGAAGAT GT G GC C AC GT C CCAAGAC GACT GT 114 0 

Qy 1141 TACAAGT TT GC CAT CT CAC AGT CAT CC AC G GG C ACT GT TAT GGGAGCTGT T AT CAT GGAG 1200 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 
Db 1141 TACAAGTTT GC CATCTCACAGT CAT CCACGGGCACT GTT AT GGGAGCTGTT AT CAT GGAG 1200 

Qy 1201 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1260 

I I I I I I I I I I I I I I II I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1201 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 12 60 

Qy 1261 CAT GTGC AC GAT GAGT T C AGGAC GGCAGC GGT GGAAGG C CCT T T T GT CAC CT T GGACAT G 132 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1261 CAT GTGCAC GAT GAGT T CAGGAC GGC AG C GGT GGAAGGC CCT TT T GT C AC CT T GGACAT G 132 0 

Qy 1321 GAAGACT GT GGCT ACAACAT T C C ACAGACAGAT GAGT CAAC CCT CAT GAC CAT AGC CT AT 138 0 

I I I I I I I I I I I I I I I I I II I I I I I II I I M I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
Db 1321 GAAGACT GT GGCTACAACATT CCACAGACAGATGAGT CAAC CCT CAT GAC C AT AGC CT AT 1380 

Qy 1381 GT CAT GGCT GCCAT CT GCGC CCT CTT CAT GCT GCCACT CTGCCT CAT GGT GT GTCAGT GG 144 0 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1381 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 14 4 0 

Qy 1441 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1441 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 

Qy 1501 AAGT GAGGAGGC C CAT GGGC AGAAGAT AGAGAT T C C C CT GGAC C ACAC CT C C GT GGT T CA 1560 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 1501 AAGT GAGGAGGC C CAT G GGC AGAAGAT AGAGATT C C C CT GGAC C ACAC CT C C GT GGT T CA 1560 

Qy 1561 CT TT GGT CACAAGT AGGAGAC AC AGAT GGCAC C T GT GGC C AGAGC ACCT C AG GAC C CT C C 1620 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 1561 C T T T GGT CACAAGT AGGAGAC AC AGAT GGCAC CT GT GGC C AGAGC ACCT CAGGAC CCT C C 1620 

Qy 1621 C CAC CCAC CAAAT GCCTCTGC CTT GAT GGAGAAGGAAAAGGCTGGCAAGGTG GGT TCCAG 168 0 

I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1621 C CAC CCAC CAAAT GCCTCTGC CTT GAT GGAGAAGGAAAAGGCTGGCAAGGTGGGTTCC AG 168 0 


Qy 

Db 


1681 
1681 


1740 
1740 


Qy 1741 CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCT 18 00 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1741 CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCT 1800 

Qy 18 01 TTGTCCACCATTCCTTTAAATTCTCCAACCCTWVGTATTCTTCTTTTCTTAGTTTCAGAA 18 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 18 01 TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 18 60 

Qy 1861 GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAAG 1920 

I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1861 GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAAG 1920 

Qy 1921 AGAC CAAG CTT GTT T C C CT GCT GG CCAAAGT C AGT AG GAGAGGAT GCAC AGTT T GC T AT T 1980 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I II 

Db 1921 AGACCAAGCTT GTTTCC CT GCT GGCCAAAGT CAGTAGGAGAGGAT GCACAGTTT GCT ATT 1980 

Qy 1981 T GCTTT AGAGACAGGGACT GTATAAACAAGCCTAACATT GGT GCAAAGATT GCCTCTT GA 2040 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 

Db 1981 T GCTTT AGAGACAGGGACT GTATAAACAAGCCTAACATT GGT GCAAAGATT GCCTCTT GA 2 04 0 

Qy 2041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2070 

I II I I I I I I I I I I I I I I I I I I M I I I I I I I 
Db 2041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2070 


RESULT 2 
AAD17865 

ID AAD17865 standard; cDNA; 2070 BP. 
XX 

AC AAD17865; 
XX 

DT 10-DEC-2001 (first entry) 
XX 

DE Human aspartyl protease 2(a) [hu-Asp2(a)] cDNA. 
XX 

KW Human; aspartyl protease 2(a); Asp2(a); amyloid precursor protein; APP; 

KW Alzheimer's disease; AD; dementia; neurofibrillary tangle; gliosis; 

KW amyloid plaque; neuronal loss; proteolytic; nootropic; neuroprotective; 

KW chromosome llq23 . 3-24 . 1 ; ss . 
XX 

OS Homo sapiens. 


XX 

FH Key Location/Qualifiers 

FT CDS 1. .1506 

FT /*tag= a 

FT /product- "Human aspartyl protease 2(a)" 

FT sig_peptide 1..63 

FT /*tag= b 

FT matpeptide 64 . . 1503 

FT /*tag= c 

FT /product= "Mature human aspartyl protease 2(a)" 

XX 


PN GB2357767-A. 
XX 

PD 04-JUL-2001. 
XX 

PF 22-SEP-2000; 2000GB-0023315 . 
XX 

PR 23-SEP-1999; 

PR 23-SEP-1999; 

PR 23-SEP-1999; 

PR 13-OCT-1999; 

PR 06-DEC-1999; 
XX 

PA (PHAA ) PHARMACIA & UPJOHN CO. 
XX 

PI Bienkowkski MJ, Gurney M; 
XX 

DR WPI; 2001-444208/48. 

DR P-PSDB; AAE10629. 
XX 

PT Polypeptide comprising fragments of human aspartyl protease with 

PT amyloid precursor protein processing activity and alpha-secretase 

PT activity, for identifying modulators useful in treating Alzheimer's 

PT disease - 
XX 

PS Example 2; Fig 2; 187pp; English. 
XX 

CC The patent discloses human aspartyl protease 1 (hu-Aspl) or modified 

CC Aspl proteins which lack transmembrane domain or amino terminal 

CC domain or cytoplasmic domain and retains alpha-secretase activity 

CC and amyloid protein precursor (APP) processing activity. The proteins 

CC of the invention are useful for assaying hu-Aspl alpha-secretase 

CC activity, which in turn is useful for identifying modulators of 

CC hu-Aspl alpha-secretase activity, where modulators that increase 

CC hu-Aspl alpha-secretase activity are useful for treating Alzheimer's 

CC disease (AD) which causes progressive dementia with consequent 

CC formation of amyloid plaques, neurofibrillary tangles, gliosis and 

CC neuronal loss. Hu-Aspl protease substrate is useful for assaying 

CC hu-Aspl proteolytic actiOity, by contacting hu-Aspl protein with 

CC the substrate under acidic conditions and determining the level of 

CC hu-Aspl proteolytic activity. The present sequence is a cDNA encoding 

CC long form of human Asp2 protein, designated as Asp2 (a) . Asp2 gene is 

CC localised on chromosome llq23 . 3-24 . 1 . 

XX 

SQ Sequence 2070 BP; 476 A; 583 C; 562 G; 449 T; 0 other; 

Query Match 100.0%; Score 2070; DB 22; Length 2070; 
Best Local Similarity 100.0%; Pred. No. 0; 

Matches 2070; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 


99US-0155493. 
99US-0404133. 
99WO-US20881 . 
99US-0416901. 
99US-0169232. 


Qy 

Db 


61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 


120 


120 


Qy 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 18 0 

Qy 181 GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGATGACC 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 GT GGAGAT GGT GGACAACCTGAGGGGCAAGT CGGGGCAGGGCT ACTACGT GGAGATGACC 24 0 

Qy 241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2 41 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 300 

Qy 301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I 
Db 3 01 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

Qy 3 61 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 420 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 

Db 361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 42 0 

Qy 421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 48 0 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 48 0 

Qy 4 81 GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATCCTG 54 0 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I M I I I I I I I 
Db 4 81 GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATCCTG 54 0 

Qy 541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

I I I I I I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

Qy 601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I II 

Db 601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

Qy 661 C C CCTCAACCAGTCT GAAGT GCT GGCCT CT GT CGGAGGGAGCATGATCATT GGAGGTAT C 72 0 

I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I 
Db 661 CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGGTATC 72 0 

Qy 721 GAC C ACT C GC T GT AC AC AGGCAGT C TCT GGT AT ACAC C CAT C C GGC GGGAGT GGT AT TAT 78 0 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I 
Db 721 GAC CACT C GCT GT AC AC AGGCAGT C TCT GGT AT ACAC C CAT C C GGC GGGAGT GGT AT TAT 78 0 

Qy 781 GAGGT CAT C ATT GT GC G GGT GGAGAT CAAT GGAC AGGAT CT GAAAAT GGACT GCAAGGAG 84 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db - 781 GAGGT CAT CATT GT GC GGGT GGAGAT CAAT GGACAGGAT CT GAAAAT GGACT GCAAGGAG 84 0 

Qy 841 TACAACTATGACAAGAGCATTGTGGACAGTGGCACCACC7\ACCTTCGTTTGCCCAAGAAA 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 841 TACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCCAAGAAA 900 

Qy 901 GTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCTGAT 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I II I I I I I I 
Db 901 GT GTT T GAAGCT G CAGT CAAAT C C ATCAAGGC AGC C T C CT C CAC GGAGAAGT T CC CT GAT 960 

Qy 961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 1020 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 1020 

Qy 1021 TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCACC 10 8 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1021 TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCACC 108 0 

Qy 1081 AT C CTT C C G CAGCAAT ACCT GC GGC C AGT GGAAGAT GT GGC C AC GT C C CAAGAC GACT GT 1140 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 

Db 1081 AT C C T T C C GCAGCAAT ACCT GC GGC C AGT GGAAGAT GT GGC C AC GT C C CAAGAC GACT GT 114 0 

Qy 1141 TACAAGT TT GC CAT CT CACAGT CAT C CAC GGGCACT GTT AT GGGAGCT GT TAT CAT G GAG 12 00 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1141 TACAAGT T T GC CAT CT CACAGT CAT C CAC GG GCACT GT T AT G GGAGCT GT TAT CAT G GAG 12 00 

Qy 1201 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1260 

I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 1201 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1260 

Qy 1261 CAT GT GCAC GAT GAGT T C AGGAC GGC AGC G GT GGAAGGC C CTTT T GT CAC CT T GGAC AT G 1320 

I I I I I II I I I I II I I I I I If I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I II I I I I 

Db 1261 CAT GT GCAC GAT GAGT T C AGGACGG CAGC GGT GGAAG GC C CTTT T GT CAC CT T GGAC AT G 1320 

Qy 1321 GAAGACT GT GG CT ACAAC AT T C C AC AGACAGAT GAGT CAAC CCT CAT GAC CAT AG C CT AT 1380 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 

Db 1321 GAAGAC T GT GGCT ACAAC AT T C CAC AGACAGAT GAGT CAAC CCT CAT GAC CAT AGC C TAT 138 0 

Qy 1381 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 14 4 0 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I 

Db 1381 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 14 4 0 

Qy 1441 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 

I I I I I I I M I I I I I I I I I I I I I I I I I I I I I Ml I I I I I I I II I I I I I I I I I I I I I I I I M 

Db 1441 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 

Qy 1501 AAGT GAGGAG GCC CAT GGGC AGAAGAT AGAGAT T C C C CT GGAC C ACAC CTCCGTGGTT CA 1560 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1501 AAGT GAGGAGGCC C AT GGGC AGAAGAT AGAGAT T C C C CT GGAC CACAC CT CC GT GGT T CA 15 60 

Qy 1561 CTTT GGT C ACAAGT AG GAGACAC AGAT GGCAC CT GT GGC CAGAGC AC CT CAGGAC C CT C C 1620 

I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I II I II II I I I I II I I I I I I I I I I 

Db 1561 CT T T GGT C ACAAGT AG GAGACAC AGAT GGCAC C TGT GGC CAGAG C ACCT CAGGAC C CTC C 1620 

Qy 1621 CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCAG 168 0 

I I I I I I I I I I I I I I I I M I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I II I I I II 

Db 1621 CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCAG 168 0 

Qy 1681 GGACTGTACCT GTAGGAAACAGAAAAGAGAAGAAAGAAGCACT CT GCTGGCGGGAATACT 174 0 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I M I I I I I I II I I I I I 

Db 1681 GGAC T GT AC CT GT AGGAAAC AGAAAAGAGAAGAAAGAAG CACT CT G CT GGC GGGAAT ACT 174 0 

Qy 1741 CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCT 18 00 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I M I I I M II I I I I I I I I I I I I I I 

Db 1741 CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCT 18 00 

Qy 18 01 TTGTCCACCATTCCTTTAAATTCTCC7\ACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 18 60 

I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 


Db 18 01 TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 18 60 

Qy 1861 GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAAG 1920 

I I I I I I I I I I M I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1861 GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAAG 1920 

Qy 1921 AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCTATT 1980 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 

Db 1921 AGAC CAAGC T TGTTTCCCT GCT GGC CAAAGT C AGTAGGAGAGGATGCAC AGT T T GC T AT T 1980 

Qy 1981 T GCT T T AGAGAC AGGGACT GT ATAAACAAGC CTAAC AT T GGT GCAAAGAT TGC CT CTT GA 2040 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I II I I I I I I I I I I I 

Db 1981 T G C TT T AGAGACAGGGACT GT ATAAACAAGCCT AAC AT T GGT GCAAAGATT GC C T CTT GA 2 04 0 

Qy 2 041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 207 0 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 2041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2070 


RESULT 3 
AAD13021 

ID AAD13021 standard; cDNA; 2070 BP. 
XX 

AC AAD13021; 
XX 

DT 23-OCT-2001 (first entry) 
XX 

DE Human aspartyl protease 2a (Hu-Asp2a) cDNA. 
XX 

KW Human; aspartyl protease 2a; Asp 2a; beta-amyloid precursor protein; APP; 

KW beta-secretase; Alzheimer's disease; dementia; amyloid plaque; gliosis; 

KW neurofibrillary tangle; neuronal loss; amyloid-beta peptide; nootropic; 

KW neuroprotective; antisense therapy; gene therapy; 

KW chromosome llq23 . 3-24 . 1 ; ss. 
XX 

OS Homo sapiens. 


XX 

FH Key Location/Qualif iers 

FT CDS 1..1506 
FT /*tag= a 

FT /product^ "Human aspartyl protease 2a (Hu-Asp2a) " 

FT sig_peptide 1..63 

FT /*tag= b 

FT mat_peptide 64.. 1503 

FT /*tag= c 

FT /product= "Mature human aspartyl protease 2a (Hu-Asp2a) " 

XX 


PN WO200150829-A2 . 
XX 

PD 19-JUL-2001. 
XX 

PF 09-MAY-2001; 2001WO-IB00799 . 
XX 

PR 09-MAY-2001; 2001WO-IB00799 . 
XX 

PA (BIEN/) BIENKOWSKI M J. 

PA (GURN/) GURNEY M E. 


PA (HEIN/) HEINRIKSON R L. 

PA (PARO/) PARODI L A. 

PA (YANR/) YAN R. 
XX 

PI Bienkowski MJ, Gurney ME, Heinrikson RL, Parodi LA, Yan R; 
XX 

DR WPI; 2001-483072/52. 

DR P-PSDB; AAE06859. 
XX 

PT Novel purified polypeptide comprising fragment of mammalian aspartyl 

PT protease 2, lacking Asp2 transmembrane domain and retaining beta 

PT secretase activity of Asp2 useful for identifying inhibitors of Asp2 

PT activity - 

XX 

PS Claim 98; Fig 2; 185pp; English. 
XX 

CC The invention relates to human aspartyl proteases (Hu-Asp) , beta-amyloid 

CC precursor protein (APP) isoforms and their corresponding DNA molecules. 

CC Human aspartyl proteases can act as beta-secretase proteases useful for 

CC treating Alzheimer's disease. APP isoforms are useful for identifying 

CC modulators of amyloid-beta peptide production, for use in designing 

CC therapeutics for the treatment and prevention of Alzheimer's disease, 

CC dementia, formation of amyloid plaques, neurofibrillary tangles, gliosis 

CC and neuronal loss. APP isoforms are also used in methods for identifying 

CC inhibitors and modulators of human Asp2 activity. The invention relates 

CC to a method for identifying agents that modulate the activity of human 

CC aspartyl protease Asp2 . Amyloid-beta peptides obtained from APP are used 

CC as a means to screen in cellular assays for the inhibitors of beta- and 

CC gamma- secretase. Hu-Asp DNA fragments are useful as probes or primers in 

CC polymerase chain reactions (PCR) . The probes are useful for detecting 

CC Hu-Asp nucleic acids in in vitro assays and in Northern and Southern 

CC blots. The present cDNA sequence encodes human aspartyl protease 2 

CC (Hu-Asp2) , a ^long' form designated as (Hu-Asp2a). Hu-Asp 2 gene is 

CC localised on chromosome llq23 . 3-24 . 1 . 
XX 

SQ Sequence 2070 BP; 476 A; 583 C; 562 G; 449 T; 0 other; 

Query Match 100.0%; Score 2070; DB 22; Length 2070; 
Best Local Similarity 100.0%; Pred. No. 0; 

Matches 2070; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 

Db 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

Qy 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 120 

I II I I I I II I I I I II I I I I I I I I I I I I II I II I I I I I I I I I I I I I II I II I II I I I I I I I 

Db 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 120 

Qy 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 180 

I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | 

Db 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 180 

Qy 181 GT GGAGAT G GT GGACAAC CT GAGGGG CAAGT C GGGG CAGGGCT ACT AC GT GGAGAT GAC C 240 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 GT GGAGAT GGT GGACAAC CTGAGGGGCAAGTC GGGG CAGGGCT ACT AC GT GGAGAT GAC C 240 


Qy 241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTiWCTTTGCA 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 300 

Qy 301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

Qy 361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 42 0 

Qy 421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 480 

Qy 4 81 GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATCCTG 54 0 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I 

Db 4 81 GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCC7VACTGGGAAGGCATCCTG 54 0 

Qy 541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 11 I I I I I I I I II I I I I I I I I I I I I I I I I I I I 

Db 541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

Qy 601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I 

Db 601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

Qy 661 CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGGTATC 72 0 

I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I i I I I I I I I I II I I I I I I I I II I I I I I I I I I 

Db 661 CC C CT CAAC CAGT CTGAAGTGCTGGCCTCTGTCGGAGGGAG CAT GAT CAT TGGAGGT AT C 720 

Qy 721 GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTATTAT 7 80 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 

Db 721 GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTATTAT 780 

Qy 781 GAGGT CAT C ATT GT G C GGGT GGAGAT CAAT G GAC AGGAT CT GAAAAT GGACT GCAAGGAG 84 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I 

Db 781 GAGGT CAT C ATT GTGCGGGTG GAGAT CAAT GGAC AGGAT CT GAAAAT GGACT GCAAGGAG 84 0 

Qy 841 T ACAACT AT GACAAGAGC ATT GT GGAC AGT GGC AC CAC CAAC CTTCGTTTGCC CAAGAAA 900 

I I I I I II I I I I I I I I I I I I I I I I I I I II I M I I I I I I I I II I I I I I I I I I I I I I I I I I I I 

Db 841 T ACAACT AT GACAAGAGC ATT GT GGAC AGT GGC AC CAC CAAC CT T C GTTT GC C CAAGAAA 900 

Qy 901 GT GT TT GAAG CT GC AGT CAAAT C CAT CAAGGC AGCCT C CT C CAC GGAGAAGT T CC CT GAT 960 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 

Db 901 GTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCTGAT 960 

Qy 961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 1020 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 1020 

Qy 1021 T T C C CAGT CAT CT C ACT CT AC CT AAT GGGT GAGGTT AC CAAC CAGT C CT T C C G CAT CAC C 1080 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I M M I I I I I I I II I I I I II I I 

Db 1021 TTCC CAGT CAT CTCACTCTACCTAAT GGGT GAGGTTAC CAAC CAGT CCTTCCGCAT CAC C 1080 


Qy 1081 ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGACTGT 1140 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 1081 ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGACTGT 1140 

Qy 1141 T ACAAGT T T GC C AT CT C AC AGT CAT C C ACGG G CACT GTT AT GGGAGCT GT TAT CAT GGAG 1200 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I Nl I I I I I I I I M I I I I I I I I I I I I 
Db 1141 TACAAGTTT GCCAT CTCACAGT CAT CCACGGGCACT GTTATGGGAGCT GTTATCAT GGAG 1200 

Qy 12 01 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1260 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 12 01 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1260 

Qy 12 61 CAT GT G C AC GAT GAGTT C AGGAC GG C AGC GGT GGAAGGC C CT T T T GT C AC CTT GGAC AT G 1320 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 1261 CAT GT GCACGAT GAGTT C AG GAC GGC AGC GGT GGAAG GCC CTT T T GT CAC CTT GGAC AT G 1320 

Qy 1321 GAAGACT GT GGCT ACAACAT T C CAC AGACAGAT GAGT CAAC C CT CAT GAC C AT AGC CTAT 1380 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1321 GAAGACT GT GGCT ACAACAT T C CAC AGACAGAT GAGT CAAC C CT CAT GAC CAT AGC CTAT 1380 

Qy 13 81 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 144 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1381 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 1440 

Qy 14 41 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 14 41 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 

Qy 1501 AAGT GAGGAGGC C CAT GGGC AGAAGAT AGAGATT C C C CTGGAC CACAC CT CC GT GGT T CA 1560 

I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I i I II I I I I I I I I I I I I II I I I I I 
Db 1501 AAGT GAGGAGGC C CAT GGGC AGAAGAT AGAGAT T C C C CTGGAC CACAC CT CC GT GGTT CA 1560 

Qy 1561 CT TT GGT C ACAAGT AGGAGACACAGAT GGC AC CT GT GG CCAGAGCAC CT CAGGAC C CT C C 1620 

I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
Db 15 61 CTTT GGT C ACAAGT AG GAGAC ACAGAT GGC AC CT GT GG CCAGAG CAC CT CAGGAC C CT C C 1620 

Qy 1621 CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCAG 1680 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1621 CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCAG 1680 

Qy 1681 GGACT GTACCT GTAGGAAACAGAAAAGAGAAGAAAGAAGCACT CT GCT GGCGGGAATACT 174 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
Db 1681 GGACT GTACCT GTAGGAAACAGAAAAGAGAAGAAAGAAGCACT CT GCT GGCGGGAATACT 174 0 

Qy 17 41 CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCT 1800 

I I I I II I I I I I I I E I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 17 41 CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCT 1800 

Qy 18 01 TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 1860 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I 
Db 18 01 TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 1860 

Qy 18 61 GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAAG 192 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 
Db 18 61 GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAAG 192 0 

Qy 1921 AGAC CAAGCT TGTTTCCCTGCTGGC CAAAGT C AGT AG GAGAGGAT GCACAGT T T GCT AT T 198 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 


Db 1921 AGAC CAAGCT T GTT T C C CT GCT GGC CAAAGT C AGT AGGAGAGGAT G CAC AGT T T G CTAT T 1980 

Qy 1981 T G CTT T AGAGACAGG GACT GT ATAAACAAGC CT AACAT T G GT GCAAAGAT TGCCTCTT GA 2040 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 

Db 1981 TGCTTTAGAGACAGGGACTGTATAAACAAGCCTAACATTGGTGCAAAGATTGCCTCTTGA 204 0 

Qy 2041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2070 

I I I I I I I I I I I I I I I II I I I I I I I I II I I I 
Db 2041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2070 


AAS11517; 

24-OCT-2001 (first entry) 

Human cDNA encoding Aspartyl protease 2 (a) , Asp2 (a) . 

Human; Aspartyl protease; Asp2 (a) ; beta-secretase ; nootropic; 
neuroprotective; amyloid protein precursor; APP; Alzheimer ! s disease; 
amyloid-beta; Abeta; ss. 


RESULT 4 
AAS11517 

ID AAS11517 standard; cDNA; 2070 BP. 
XX 
AC 
XX 
DT 
XX 
DE 
XX 
KW 
KW 
KW 
XX 
OS 
XX 
FH 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
XX 
PN 
XX 
PD 
XX 
PF 
XX 
PR 
XX 
PA 
PA 
PA 
PA 
PA 
XX 


Homo sapiens , 

Key 
CDS 


sig_peptide 
sig_peptide 

sig_peptide 

mat peptide 


Location/Qualifiers 
1..1506 
/*tag= a 

/product^ "Asp2(a) M 
1. .63 
/*tag= b 
64. .135 
/*tag= c 

/label= Pre_pro_peptide 
136. . 171 
/*tag= d 

/label= Pro_peptide 
172. .1503 
/*tag= e 

/label= Mature Asp2(a) 


WO200149098-A2. 
12-JUL-2001. 

09-MAY-2001; 2001WO-IB00798 . 

09-MAY-2001; 2001WO-IB00798 . 

(BIEN/) BIENKOWSKI M J. 

(GURN/) GURNEY M E. 

(HEIN/) HEINRIKSON R L . 

(PARO/) PARODI L A. 

(YANR/) YAN R. 


PI Bienkowski MJ, Gurney ME, Heinrikson RL, Parodi LA, Yan R; 
XX 

DR WPI; 2001-502549/55. 

DR P-PSDB; AAU06603. 
XX 

PT Novel purified polypeptide comprising fragment of mammalian aspartyl 

PT protease 2, lacking Asp2 transmembrane domain and retaining beta 

PT secretase activity of Asp2 useful for identifying inhibitors of Asp2 

PT activity - 

XX 

PS Claim 98; Fig 2; 185pp; English. 
XX 

CC The invention relates to a purified polypeptide comprising a fragment of 

CC mammalian aspartyl protease (Asp) 2 protein which lacks the Asp2 

CC transmembrane domain and the Asp2 protein, and where the polypeptide and 

CC the fragment retain the beta-secretase activity of the mammalian Asp2 

CC protein. The invention also details polynucleotides for the Asp 

CC proteins and vectors expressing them, and a polypeptide (isoform of 

CC amyloid protein precursor (APP) ) comprising the amino acid sequence of an 

CC APP or its fragment containing an APP cleavage site recognizable by a 

CC mammalian beta-secretase, and further comprising two lysine residues at 

CC the carboxyl terminus of the amino acid sequence of the mammalian APP or 

CC APP fragment. Also included in the invention are methods of identifying 

CC modulators or inhibitors of Asp2 . Modulators and inhibitors of Asp2 are 

CC useful for treating Alzheimer's disease. APP is useful in methods for 

CC identifying inhibitors or modulators of human Asp2 activity and 

CC amyloid-beta (Abeta) peptide production. APP is also useful in designing 

CC therapeutics for the treatment or prevention of Alzheimer's disease. 

CC APP comprising the APP-Sw-beta-secretase peptide sequence (NLDA) , which 

CC is associated with increased levels of Abeta processing is useful in 

CC assays relating the Alzheimer's research. The expression vector is useful 

CC for recombinantly expressing APP. Nucleic acids that hybridise to 

CC Asp oligonucleotides are useful as probes or primers. The probes are 

CC useful for detecting Hu-Asp nucleic acids in in vitro assays and in 

CC Northern and Southern blots. The present sequence encodes human Asp2(a) . 

XX 

SQ Sequence 2070 BP; 476 A; 583 C; 562 G; 449 T; 0 other; 

Query Match 100.0%; Score 2070; DB 22; Length 2070; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 207 0; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 

Db 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

Qy 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 12 0 

II I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I f I I I I I I I I ! I I I I I I I I I I I 

Db 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 12 0 

Qy 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 18 0 

Qy 181 GT GGAGAT GGT GGACAACCTGAGGGGCAAGT CGGGGCAGGGCTACTACGT GGAGAT GACC 24 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 GT GGAGAT GGT GGACAACCT GAGGGGCAAGT CGGGGCAGGGCTACTACGT GGAGAT GACC 24 0 


Qy 241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 30 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 

Db 241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 300 

Qy 301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

Qy 361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 42 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 42 0 

Qy 421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 480 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 421 CT G GG CACC GAC CT GGTAAG CAT C C C CCAT G GC C C CAAC GT CACT GT GC GT GC CAACAT T 480 

Qy 4 81 G CT GC CAT CACT GAAT C AGACAAGT T CT T CAT CAAC GGCT C CAACTGGGAAG GC ATCCT G 54 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 4 81 GCT G C CAT CACT GAAT C AGACAAGT T CTT CAT CAAC GGCT C CAACTGGGAAG GC ATCCT G 54 0 

Qy 541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 

Db 541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 60 0 

Qy 601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

Qy 661 C C C C T CAAC CAGT C T GAAGT GCT GGCCT CT GT C G GAGGGAGC AT GAT CAT T G GAGGTAT C 72 0 

I I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I II I M I I I I I II 

Db 661 C C C C T CAAC CAGT C T GAAGT GCT GGCCT CT GT C G GAGGGAGC AT GAT CAT T GGAGGTAT C 72 0 

Qy 721 GACCACTCGCT GTACACAGGCAGT CTCT GGTATACACCCAT CCGGCGGGAGT GGTATTAT 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I II I I I I I I I II I I I I I I I I I I I 

Db 721 GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTATTAT 78 0 

Qy 7 81 GAGGT CAT CATT GT GCGGGT GGAGAT CAAT GGAC AGGAT CT GAAAAT GGACT GCAAGGAG 84 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 781 GAG GT CAT CATT GT GCGG GT GGAGAT CAAT GGAC AGGAT CT GAAAAT GGACT GCAAGGAG 84 0 

Qy 841 T ACAACT AT GACAAGAGCAT T GT GGAC AGT GGCAC CAC CAAC CT T CGT T T GC C CAAGAAA 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 841 T ACAACT AT GACAAGAGCAT T GT GGAC AGT GGCAC CAC C AAC CT T CGTT T GC C CAAGAAA 900 

Qy 901 GT GT T T GAAG CT GCAGT CAAAT C CAT CAAGGC AG C CT C C T C CAC GGAGAAGTT C CCT GAT 960 

I I I I I I I I II I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 901 GT GT T T GAAG CT GCAGT CAAAT C CAT CAAGGC AGC CT C C T C CAC GGAGAAGTT C CCT GAT 960 

Qy 961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 1020 

I I I I I I I I I M I I I M I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I M 

Db 961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 1020 


Qy 

Db 


1021 
1021 


1080 
1080 


Qy 1081 AT C CT T C C G C AGGAATAC CT GC G GCCAGT GGAAGAT GT G GC CAC GT C C CAAGAC GACT GT 1140 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I M I I I I I I 
Db 1081 ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGACTGT 1140 

Qy 1141 T ACAAGT TT GC C AT CT CAC AGT CAT C CAC GG GC ACT GT T AT GGGAGC T GT TAT CAT GGAG 1200 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1141 T ACAAGT TT GC CAT CT CAC AGT CAT C C AC GGGC ACT GT TAT GGGAGCT GT TAT CAT GGAG 1200 

Qy 1201 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1260 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 

Db 1201 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1260 

Qy 12 61 CAT GT G CAC GAT GAGT T C AGGAC GGC AGC G GT GGAAG GC C C TTT T GT CAC CT T GGACAT G 1320 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
Db 1261 CAT GT G CAC GAT GAGT T C AGGAC GGC AGC GGT GGAAG GC C CTTTT GT CAC CT T GGACAT G 1320 

Qy 1321 GAAGACTGT GGCT ACAACATT CCACAGACAGAT GAGT CAACCCTCATGACCATAGCCTAT 1380 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1321 GAAGACT GT GGCT ACAAC AT T C CACAGAC AGAT GAGT CAAC CCT C AT GAC CAT AG C CTAT 1380 

Qy 1381 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 1440 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
Db 1381 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 14 40 

Qy 1441 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
Db 1441 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 

Qy 1501 AAGT GAGGAGGC C CAT GGGC AGAAGATAGAGAT T C CC CT GGAC CAC AC CT C C GT GGT T C A 1560 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I 
Db 1501 AAGT GAGGAGGC C CAT GGGC AGAAGATAGAGAT T C CC CT GGACCACAC CT C C GT G GT T C A 1560 

Qy 1561 CT T T GGT C ACAAGTAGGAGAC AC AGAT GGCAC CT GTGGC C AGAGC AC CT CAGGAC C CT C C 1620 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 1561 CT T T GGT CACAAGTAGGAGAC AC AGAT GGCAC CT GT GGC C AGAGC AC CT CAGGAC C CT C C 1620 

Qy 1621 C CAC C C ACCAAAT GC CTCTGCCTT GAT GGAGAAGGAAAAG GCT GGCAAGGT GGGT T C C AG 1680 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1621 CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCT GGCAAGGT GGGTTCCAG 1680 

Qy 1681 GGACT GT AC CT GT AGGAAAC AGAAAAGAGAAGAAAGAAGCACT CT GCT GGC GGGAAT ACT 1740 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1681 GGACT GT AC CT GT AG GAAAC AGAAAAGAGAAGAAAGAAGCACT CT GCT G GC GGGAAT ACT 17 40 

Qy 1741 CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCT 18 00 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1741 CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCT 18 00 

Qy 1801 TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 18 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 1801 TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 18 60 

Qy 1861 GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAAG 1920 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1861 GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAAG 1920 

Qy 1921 AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCTATT 198 0 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 

Db 1921 AGAC CAAGCT T GTTTCCCTGCT G GC CAAAGT C AGT AGGAGAGGAT GCACAGT T T GCT AT T 19 80 

Qy 1981 T GCT TT AGAGACAGG GACT GT ATAAACAAG C C T AACAT T GGT GCAAAG AT T GC CT CT T GA 2040 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1981 T GCTTTAGAGACAGGGACTGTATAAACAAGCCTAACATT GGT GCAAAGATT GCCTCTT GA 2040 

Qy 2041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2070 

I I I I I I I I I I I I I I I I I I I I I I I I I M I I I 
Db 2041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2070 


RESULT 5 
AAS11702 

ID AAS11702 standard; DNA; 2070 BP. 
XX 

AC AAS11702; 
XX 

DT 24-OCT-2001 (first entry) 
XX 

DE DNA encoding human aspartyl protease 2a (Asp-2a) . 
XX 

KW Human; aspartyl protease 1; Asp-1; nootropic; neuroprotective; 

KW aspartyl protease 2; Asp2; amyloid protein precursor; APP; 

KW beta-secretase; Alzheimer's disease; ds . 
XX 

OS Homo sapiens . 


XX 

FH Key Location/Qualifiers 

FT CDS 1. . 1506 

FT /*tag= a 

FT /product= "Aspartyl protease-2a (Asp-2a) " 

FT sig_peptide 1..63 

FT /*tag= b 

FT misc__f eature 64 . . 135 

FT /*tag= c 

FT /note= "Pre-propeptide" 

FT misc^feature 136.. 171 

FT /*tag= d 

FT /note= "Propeptide" 

FT mat_peptide 172 . . 1503 

FT /*tag= e 

FT /note= "Mature Aspartyl protease-2a" 

XX 


PN WO200149097-A2. 
XX 

PD 12-JUL-2001. 
XX 

PF 09-MAY-2001; 2001WO-IB00797 . 
XX 

PR 09-MAY-2001; 2001WO-IB00797 . 
XX 

PA (BIEN/) BIENKOWSKI M J. 

PA (GURN/) GURNEY M E. 

PA (HEIN/) HEINRIKSON R L. 

PA (PARO/) PARODI L A. 

PA (YANR/) YAN R. 


XX 

PI Bienkowski MJ, Gurney ME, Heinrikson RL, Parodi LA, Yan R; 
XX 

DR WPI; 2001-502548/55. 

DR P-PSDB; AAU07102. 
XX 

PT Novel purified polypeptide comprising fragment of mammalian aspartyl 

PT protease 2, lacking Asp2 transmembrane domain and retaining beta 

PT secretase activity of Asp2 useful for identifying inhibitors of Asp2 

PT activity - 

XX 

PS Claim 98; Fig 2; 185pp; English. 
XX 

CC The invention relates to a novel purified polypeptide comprising a 

CC fragment of mammalian aspartyl protease 2 (Asp2) protein which lacks the 

CC Asp2 transmembrane domain and the Asp2 protein, and where the polypeptide 

CC and the fragment retain the beta-secretase activity of the mammalian Asp2 

CC protein. Also included is an isoform of amyloid protein precursor (APP) 

CC comprising the amino acid sequence of a APP or its fragment containing 

CC an APP cleavage site recognisable by a mammalian beta-secretase, and 

CC further comprising two lysine residues at the carboxyl terminus of the 

CC amino acid sequence of the mammalian APP or APP fragment. The 

CC polypeptides are used for assaying for modulators of beta-secretase 

CC activity; identifying agents that inhibit the APP processing activity 

CC of human Asp2 aspartyl protease (Hu-Asp2); identifying agents that 

CC modulate the activity of Asp2 ; and for reducing cellular production of 

CC amyloid beta (Abeta) from APP. Agents identified by the above methods 

CC are useful for treating Alzheimer 1 s disease; and for identifying 

CC modulators of amyloid-beta (Abeta) peptide production, for use in 

CC designing therapeutics for the treatment or prevention of Alzheimer's 

CC disease. Probes and primers derived from Asp nucleic acid sequences 

CC are useful for detecting Hu-Asp nucleic acids in in vitro assays and in 

CC Northern and Southern blots. The present sequence represents the 

CC coding sequence of human Asp-2a used in the methods of the invention. 

XX 

SQ Sequence 2070 BP; 476 A; 583 C; 562 G; 449 T; 0 other; 

Query Match 100.0%; Score 2070; DB 22; Length 2070; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 2 070; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 

Db 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

Qy 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 120 

I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I II I ! I I 
Db 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 120 

Qy 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 180 


Qy 

Db 


181 GT GGAGAT GGT GGACAAC CT GAGG G GCAAGT CG G G G C AGGGCT ACT AC GT GGAGAT GAC C 24 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

181 GT GGAGAT GGT GGACAACCT GAGGGGCAAGT CGGGGCAGGGCTACTACGT GGAGAT GACC 240 


241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

241 GT GGG C AG C CCC C C GCAGAC GCT CAAC AT C CT G GTGGAT ACAGGCAGCAGT AACTT T GC A 300 

3 01 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

I I I I I I I I I I I I I I t I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 420 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 

361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 420 

421 CTGGGCACCGACCTGGTT^AGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 4 80 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 4 80 

4 81 GCT GC CAT C AC T GAAT CAGACAAGTT CT T CAT CAAC GGCT C CAACT GGGAAG GCAT C C T G 54 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

481 GCT GC CAT CACT GAAT CAGACAAGTTCTT CAT CAAC GGCTC CAACT GGGAAGGC AT CCTG 54 0 

541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 

541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

661 CCC CT CAACCAGT CT GAAGT GCT GGCCT CT GT CGGAGGGAGCATGATCATTGGAGGTATC 720 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I 

661 CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGGTATC 72 0 

721 GAC CAC T C GCT GT ACACAGG CAGT CT CT GGT AT AC AC C CAT C CGGC GGGAGT GGT AT TAT 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 

721 GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTATTAT 780 

781 GAGGT CAT CAT T GT G C GGGT G GAGAT CAAT GGAC AGGAT CT GAAAAT GGACT GCAAGGAG 84 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

7 81 GAGGT CAT CAT T GT G C GGGT G GAGAT CAAT GGACAGGAT CT GAAAAT GGACT GCAAGGAG 84 0 

841 T ACAAC TAT GACAAGAGCAT T GT GGACAGT GG CAC CAC CAAC CT T C GT T T GCC CAAGAAA 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 

841 T ACAACT AT GACAAGAGCAT T GT G GACAGT GG CAC CAC CAACCTT C GT T T GC C CAAGAAA 900 

901 GT GTT T GAAGCT GCAGT CAAAT C CAT CAAGGC AGCCT C CT C CAC GGAGAAGTT C C CT GAT 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

901 GTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCTGAT 960 

961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 1020 

1021 TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCACC 108 0 

I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1021 TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCACC 108 0 

1081 ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGACTGT 1140 


I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1081 AT C CT T C C GCAG CAAT AC CT GC GGC C AGT GGAAGAT GT GGCCACGTCCCAAGAC GACT GT 114 0 

Qy 1141 T ACAAGT T T GC CAT CT CAC AGT CAT C C AC GGGC ACT GTT ATGGGAGCT GT TAT C AT GGAG 1200 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I 
Db 1141 T ACAAGT T T GC CAT CT C ACAGT CAT C CAC GGGC ACT GTT ATGGGAGCT GT TAT CAT GGAG 1200 

Qy 1201 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1260 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 12 01 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1260 

Qy 1261 CAT GT GCACGAT GAGTT CAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGACAT G 1320 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1261 CAT GT GCAC GAT GAGT T C AGGAC GGC AGC GGT GGAAGGC CCTTTT GT CACCT T GGACAT G 1320 

Qy 1321 GAAGACT GT GGCT ACAACATT CCACAGACAGAT GAGTCAACCCT CAT GACCAT AGC CTAT 138 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1321 GAAGACT GT GGCTACAACATT CCACAGACAGAT GAGTCAACCCT CAT GACCAT AGCCT AT 1380 

Qy 1381 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 144 0 

I I I I I I I I I I I I I I I I I ! I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1381 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 144 0 

Qy 14 41 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 14 41 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 

Qy 1501 AAGTGAGGAGGCCCATGGGCAGAAGATAGAGATTCCCCTGGACCACACCTCCGTGGTTCA 1560 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1501 AAGT GAGGAGGCC CAT GGGCAGAAGAT AGAGATT CCCCT GGACCACACCT CCGTGGTT CA 1560 

Qy 15 61 CTTTGGTCACAAGTAGGAGACACAGATGGCACCTGTGGCCAGAGCACCTCAGGACCCTCC 162 0 

I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
Db 1561 CTTTGGTCACAAGTAGGAGACACAGATGGCACCTGTGGCCAGAGCACCTCAGGACCCTCC 1620 

Qy 1621 CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCAG 1680 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I II I I I I I I I I I I I I I I I I 
Db 1621 C CAC C CAC CAAAT GCCTCTGCCTT GAT GGAGAAGGAAAAGGCT GGCAAGGTGGGTTC CAG 1680 

Qy 1681 GGACT GTACCT GT AGGAAACAGAAAAGAGAAGAAAGAAGCACT CT GCTGGCGGGAATACT 1740 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I 
Db 1681 GGACT GT AC CT GT AGGAAACAGAAAAGAGAAGAAAGAAGCACT CT GCTGGC GGGAAT ACT 1740 

Qy 1741 CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCT 1800 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1741 CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCT 1800 

Qy 18 01 TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 1860 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I 
Db 18 01 TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 1860 

Qy 1861 GT ACT GGCAT CAC AC GCAG GT T AC CT T GG C GT GT GT C CCT GT GGT ACCCTGGCAGAGAAG 1920 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 18 61 GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAAG 1920 

Qy 1921 AGAC CAAGCT T GT T T C C C T GC T GGC CAAAGT C AGTAGGAGAGGAT GCACAGTTT GCTAT T 1980 

I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 


Db 1921 AGACCAAG CT T GTTTCCCTG CT GGCCAAAGT C AGT AG GAGAGGAT GCACAGT T T GC TATT 1980 


Qy 19 81 T GC TT T AGAGAC AGGGACT GT ATAAACAAG C CT AACATT G GT GCAAAGATT GC CT CT T GA 2 04 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1981 T G C TT T AGAGAC AGGGACT GT AT AAACAAGC CT AACATT G GT GCAAAGAT TGCCTCTT GA 2040 

Qy 2041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2070 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2070 
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AAD06739 

ID AAD06739 standard; cDNA; 2070 BP. 
XX 

AC AAD06739; 
XX 

DT 10-AUG-2001 (first entry) 
XX 

DE Human aspartyl protease 2a (Asp2a) cDNA. 
XX 

KW Human; alpha-secretase; amyloid precursor protein; APP; therapy; 

KW Alzheimer's disease; antialzheimer 1 s ; aspartyl protease 2a; Asp 2a; 

KW beta-secretase; chromosome llq23 . 3-24 . 1 ; ss. 
XX 

OS Homo sapiens . 
XX 

FH Key Location/Qualifiers 

FT CDS 1..1506 

FT /*tag= a 

FT /product^ "Human aspartyl protease 2a" 

FT sig_peptide 1 . . 63 

FT /*tag= b 

FT sig_peptide 64.. 135 

FT /*tag= c 

FT /note= "Pre-pro-peptide" 

FT sig_peptide 136.. 171 

FT /*tag= d 

FT /note= "Pro-peptide" 

FT mat_peptide 172.. 1503 

FT /*tag= e 

FT /product^ "Human mature aspartyl protease 2a" 
XX 

PN WO200123533-A2. 
XX 

PD 05-APR-2001. 
XX 

PF 22-SEP-2000; 2000WO-US26080 . 
XX 

PR 23-SEP-1999; 99US-0155493 . 

PR 23-SEP-1999; 99WO-US20881 . 

PR 13-OCT-1999; 99US-04 16901 . 

PR 06-DEC-1999; 99US-0169232 . 
XX 

PA ( PHAA ) PHARMACIA & UPJOHN CO. 
XX 

PI Gurney M, Bienkowski MJ; 


DR WPI; 2001-290516/30. 

DR P-PSDB; AAE02581. 
XX 

PT Enzymes that cleave the alpha-secretase site of the amyloid precursor 

PT protein, useful for the treatment of Alzheimer 1 s disease - 

XX 

PS Example 2; Page 126-127; 189pp; English. 
XX 

CC The present invention relates to enzymes for cleaving the alpha- 

CC secretase site of the amyloid precursor protein (APP) and methods of 

CC identifying those enzymes. The methods may be used to identify enzymes 

CC that may be used to cleave the alpha-secretase cleavage site of the APP 

CC protein. The enzymes may be used to treat or modulate the progress of 

CC Alzheimer's disease. The present sequence is human aspartyl protease 

CC (Asp) 2a cDNA. Asp 2a has beta-secretase protease activity. Asp2 gene 

CC is located on chromosome llq23 . 3-24 . 1 . 
XX 

SQ Sequence 2070 BP; 476 A; 583 C; 562 G; 449 T; 0 other; 

Query Match 100.0%; Score 2070; DB 22; Length 2070; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 2070; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

I I I I I I I II I I I I I II I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

Qy 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 12 0 

I I I I I I I I I M I II I I I I I I I I I I I I I I I II I I I I I I I I M I II I I I M I I I I I M I I I I 

Db 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 12 0 

Qy 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 18 0 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 18 0 

Qy 181 GT GGAGAT GGT GGACAACCT GAGGGGCAAGT CGGGGCAGGGCTACTACGT GGAGAT GACC 24 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 GT GGAGAT GGT GGACAACCT GAGGGGCAAGT CGGGGCAGGGCTACTACGT GGAGAT GACC 24 0 

Qy 241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 300 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 300 

Qy 301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

Qy 361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 42 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
Db 361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 42 0 

Qy 421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCC7VACGTCACTGTGCGTGCCAACATT 48 0 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 

Db 421 CT G GG C ACC GAC CT GGT AAGCAT C C C CC AT G GC C C CAAC GT CACT GT G C GT GC CAACAT T 480 


Qy 


481 GCT GC C AT C AC T GAAT CAGACAAGT T CT T CAT CAAC GGCT C C AACT GGGAAGG C ATC CT G 54 0 


1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II II 1 1 1 1 i 1 1 1 i 1 1 1 1 1 1 1 

Db 481 GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATCCTG 54 0 

Qy 541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

Qy 601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

I I | I I I I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 

Db 601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

Qy 661 C C C CT CAACCAGT CT GAAGT GCT GGCCT CT GT C GGAG GG AGC AT GAT CAT T GGAGGT AT C 72 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
Db 661 CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCAT GAT CATT GGAGGT AT C 72 0 

Qy 721 GAC CACT CGCT GTACACAG GC AGTCT CT GGT AT ACAC C CAT C C GG C GGGAGT GGT AT TAT 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
Db 721 GAC CACT CGCT GTACAC AGGCAGT CT CT G GT AT ACAC C CAT C C GGC GGGAGT GGT ATT AT 7 80 

Qy 781 GAGGT CAT CAT T GT GC GG GT GGAGAT CAAT GGAC AGGAT CT GAAAAT GGACT GCAAGGAG 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I M 
Db 781 GAGGT CAT CAT T GT GC GGGT GGAGAT CAAT G GAC AGGAT CT GAAAAT GGACT GCAAGGAG 84 0 

Qy 841 T ACAACT AT GACAAGAGC AT T GT GGACAGT GGCAC CAC CAAC CT T C GT T T GC C CAAGAAA 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I 
Db 841 T ACAAC TAT GACAAGAG CAT T GT GGACAGT GGCAC CAC CAACCT T C GT T T GC C CAAGAAA 900 

Qy 901 GT GTT T GAAGCT GCAGT CAAAT C CAT CAAGGC AGC CT C CT CC AC GGAGAAGT T C C CT GAT 960 

I | | | | | | | I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I M I I I I I I I I I I 
Db 901 GT GT T T GAAGCT GCAGT CAAAT C CAT CAAGG C AGC CT C CT CC AC GGAGAAGT T C C CT GAT 960 

Qy 961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 1020 

Qy 1021 T T C C CAGT CAT CT CACT CT AC C T AAT GGGT GAGGT T AC CAACCAGT C CT T C C GCAT CAC C 108 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II 
Db 1021 T T C C CAGT CAT CT CACT CT AC CTAAT GGGT GAGGT TAC CAACCAGT C CT T C C GCAT CAC C 1080 

Qy 1081 AT C CT T C C GCAGCAATAC CT GC GGC C AGT G GAAGAT GT GGC CAC GT C C C AAGACGACTGT 114 0 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I 
Db 1081 AT C CT T C C GCAGCAATAC CT GC GGC CAGT GGAAGAT GT GGC CAC GT C C CAAGAC GACTGT 114 0 

Qy 1141 T ACAAGTTT GCC AT CT CAC AGT CAT C C AC GGGCACT GT T AT GGGAGCT GT TAT CAT GGAG 1200 

I I I I I I I M I I I I I I I I I II I I I I I I I I I I I I I M I I I I I I I I I I I I I I II I I I I I I I I I 
" Db 1141 T ACAAGT T T GC CAT CT CAC AGT CAT C CACGG GCACT GT TAT GGGAGCT GTT AT CAT G GAG 12 00 

Qy 1201 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1260 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I 
Db 1201 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 12 60 

Qy 1261 CAT GT G CAC GAT GAGT T C AG GAC GGC AGCGGT GGAAGGC C CTTT T GT CAC CT T GG ACATG 1320 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1261 CAT GT GC AC GAT GAGT T C AGGAC GG C AGCGGT GGAAGGC C CTTT T GT CAC CT T GGACAT G 1320 


Qy 1321 GAAGACT GT GG CTACAAC ATT C C AC AGACAGAT GAGT CAAC C CT CAT GAC CAT AGC CT AT 13 8 0 

I I I I I I I I I I I I I I I I I I I I M I I I I I i I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 


Db 


1321 


GAAGACTGT GGCTACAACATTCCACAGACAGAT GAGTCAACCCT CATGACCATAGCCT AT 


1380 


Qy 1381 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 14 4 0 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1381 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 14 4 0 

Qy 14 41 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1441 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 

Qy 1501 AAGT GAGGAGGC C CAT GG GCAGAAGAT AGAGAT T C C CCT G GAC CACAC CT C C GT GGT T CA 1560 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1501 AAGT GAGGAGGC CCAT GGGCAGAAGAT AGAGAT T C C CCT GGAC CAC AC CT C C GT GGT T CA 15 60 

Qy 1561 CT TT GGT CACAAGT AGGAGACAC AGAT G GCAC CT GT GGC CAGAGC AC CT C AGGACCCT C C 1620 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1561 CT T T GGT CACAAGT AGGAGACAC AGAT G GCAC CT GT GGC CAGAGCAC CT C AGGAC CCT C C 1620 

Qy 1621 CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCAG 168 0 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 

Db 1621 CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCAG 1680 

Qy 1681 GGACT GT AC CT GT AGGAAACAGAAAAGAGAAGAAAGAAGCACT CT GCT GGC GGGAAT ACT 17 4 0 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1681 GGACT GT AC CT GT AGGAAACAGAAAAGAGAAGAAAGAAGCACT CT GCT GGC G G GAAT ACT 17 40 

Qy 1741 CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCT 18 00 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
Db 1741 CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCT 18 00 

Qy 1801 TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAG7\A 18 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I 

Db 1801 TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 18 60 

Qy 1861 GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAAG 192 0 

I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II M 
Db 1861 GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAAG 1920 

Qy 1921 AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCTATT 19 80 

I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I I I 

Db 1921 AGAC CAAGCT T GTT T C C CT GCT GGC CAAAGT C AGT AGGAGAGGAT GC ACAGT T T GCT AT T 19 8 0 

Qy 1981 T GCT T TAGAGAC AG GGACT GT ATAAACAAGC CTAACATT GGT G CAAAGATT GC CT CT T GA 2 04 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1981 TGCTTTAGAGACAGGGACTGT ATAAACAAGC CTAACATT GGT GCAAAGATTGCCTCTTGA 204 0 

Qy 2041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2070 

I ! I I M I I I I I I I I I I I I I I I I II I I I I M 
Db 2041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2070 


RESULT 7 
ABL52457 

ID ABL52457 standard; cDNA; 2070 BP. 
XX 

AC ABL52457; 
XX 


DT 16-JUL-2002 (first entry) 
XX 

DE Human Asp-2 (a) nucleotide sequence SEQ ID NO: 3. 
XX 

KW Human; Asp-1; Asp-2; aspartyl protease; enzyme; Alzheimer's disease; 

KW proteolytic; chromosome llq23 . 3-24 . 1; gene; ss. 

XX 

OS Homo sapiens . 
XX . 

FH Key Location/Qualifiers 

FT CDS 1..1506 

FT /*tag= a 

FT /product^ "Asp-2 (a) " 

FT /note= "aspartyl protease" 

XX 

PN GB2367060-A. 
XX 

PD 27-MAR-2002. 
XX 

PF 29-OCT-2001; 2001GB-0025934 . 
XX 

PR 23-SEP-1999; 99US-155493P. 

PR 23-SEP-1999; 99US-0404133 . 

PR 23-SEP-1999; 99WO-US2 0881 . 

PR 13-OCT-1999; 99US-0416901 . 

PR 06-DEC-1999; 99US-169232P . 

PR 22-SEP-2000; 2000GB-0023315 . 
XX 

PA ( PHAA ) PHARMACIA & UPJOHN CO. 
XX 

PI Bienkowkski MJ, Gurney M; 
XX 

DR WPI; 2002-396337/43. 

DR P.-PSDB; ABB78590. 
XX 

PT Human aspartyl protease 1 substrates useful in assays to detect 

PT aspartyl protease activity, e.g. for the diagnosis of Alzheimer 1 s 

PT disease - 
XX 

PS Example 2; Fig 2; 182pp; English. 
XX 

CC The present invention describes a human aspartyl protease 1 (hu-Aspl) 

CC substrate (I) which comprises a peptide of no more than 50 amino acids, 

CC and which comprises the 8 amino acid sequence Gly-Leu-Ala-Leu-Ala-Leu- 

CC Glu-Pro. Also described are: (1) a method (II) for assaying hu-Aspl 

CC proteolytic activity, comprising: (a) contacting a hu-Aspl protein with 

CC (I) under acidic conditions; and (b) determining the level of hu-Aspl 

CC proteolytic activity; (2) a purified polynucleotide (III) comprising a 

CC nucleotide sequence that hybridises under stringent conditions to the 

CC non-coding strand complementary to a defined 1804 nucleotide sequence 

CC (see ABL52456) where the nucleotide sequence encodes a polypeptide having 

CC Aspl proteolytic activity and lacks nucleotides encoding a transmembrane 

CC domain); (3) a purified polynucleotide (III 1 ) comprising a sequence that 

CC hybridises under stringent conditions to (III) (the nucleotide sequence 

CC encodes a polypeptide further lacking a pro-peptide domain corresponding 

CC to amino acids 23-62 of hu-Aspl {see ABB78589)); (4) a vector (IV) 

CC comprising (III) or (III 1 ); and (5) a host cell (V) transformed or 


CC transfecteci with (III), (III') and/or (IV). The hu-Aspl protease 

CC substrate (I) may be used as an enzyme substrate in assays to detect 

CC aspartyl protease activity, (II) and therefore diagnose diseases 

CC associated with aberrant hu-Aspl expression and activity such as 

CC Alzheimer's disease. Hu-Aspl has been localised to chromosome 21, while 

CC hu-Asp2 has been localised to chromosome llq23 . 3-24 . 1 . The present 

CC sequence encodes hu-Asp2(a) from the present invention. 

XX 

SQ Sequence 2070 BP; 476 A; 583 C; 562 G; 449 T; 0 other; 

Query Match 100.0%; Score 2070; DB 24; Length 2070; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 2070; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

Qy 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 120 

I I M I I I I I i I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I 
Db 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 12 0 

Qy 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 180 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I II I I 

Db 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 180 

Qy 181 GT GGAGAT GGTGGACAACCT GAGGGGCAAGTCGGGGCAGGGCT ACT ACGT GGAGAT GACC 24 0 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 GT GGAGAT GGT GGACAAC CT GAGG GGCAAGT C G G GGCAGG G CT ACT AC GT GGAGAT GACC 24 0 

Qy 241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 300 

II I I I I I I I I I I I I I M I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I 

Db 241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 300 

Qy 301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I M 

Db 301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

Qy 361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 420 

I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 

Db 361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 420 

Qy 421 C T GGGC AC C GAC CT GGT AAGCAT C CC C CAT GG C C C CAAC GT C ACT GT GC GT GC CAAC ATT 4 80 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I 
Db 421 CT GGG CAC C GAC CT GGT AAGCAT C CC C CAT GGC CC CAAC GT C ACT GT GC GT G C CAAC ATT 480 

Qy 481 GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATCCTG 54 0 

I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 

Db 481 GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATCCTG 54 0 

Qy 541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 


Qy 

Db 


601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I 

601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 


Qy 661 CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGGTATC 72 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I 

Db 661 CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGGTATC 720 

Qy 721 GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTATTAT 7 80 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 721 GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTATTAT 7 80 

Qy 781 GAGGT CAT CAT T GT G C GGGT GGAGAT CAAT GGACAGGAT CT GAAAAT GGACT GCAAGGAG 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
Db 781 GAGGT CAT CAT T GT GC GGGT GGAGAT CAAT GGACAGGATCT GAAAAT G GACT G CAAGGAG 840 

Qy 841 T ACAAC TAT GACAAGAG C ATT GT GGAC AGT G GCAC C AC CAAC CT T C GTT T GC CCAAGAAA 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 841 T ACAACT AT GACAAGAGC AT T GT GGAC AGT G GC AC CAC CAAC CT T C GTTT GC C CAAGAAA 900 

Qy 901 GT GTT T GAAGC T GC AGT CAAAT C CAT CAAGGC AG C CT C CT C CAC GGAGAAGT T C C CT GAT 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 901 GT GTTT GAAGCT GC AGT CAAAT C CAT CAAGGCAGC CT C CT C CAC GGAGAAGT T C C CT GAT 960 

Qy 961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 1020 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I M I I I I I I I I I I I I I I 
Db 961 GGT T T CT G GCTAGGAGAGCAGC T G GT GTGCTGGCAAGCAG GCAC CAC CCCTTGGAAC ATT 1020 

Qy 1021 TT C C CAGT CAT CTC ACT CT AC CT AAT GGGT GAGGT T AC CAAC C AGT CCT T CC GC AT CAC C 1080 

1 I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I M I 
Db 1021 TT C C CAGT CAT CTC ACT CT AC CTAATGG GT GAGGT T AC CAAC CAGT C CT T CC GC AT CAC C 1080 

Qy 1081 ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGACTGT 114 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 11 I I I I I I I I I I I 
Db 1081 AT C CT T CC G CAGCAAT AC CT GC GGC CAGT GGAAGAT GT GGC CAC GT C C CAAGAC GACT GT 1140 

Qy 1141 TACAAGTTT GC CAT CT CACAGT CAT CCACGGGCACTGTTATGGGAGCT GTTAT CAT GGAG 1200 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I M I I I I I I I I I I I I 
Db 1141 TACAAGTTTGCCAT CT CACAGT CAT CCACGGGCACTGTTATGGGAGCT GTTAT CAT GGAG 1200 

Qy 12 01 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1260 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 1201 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1260 

Qy 12 61 CAT GT GCAC GAT GAGT T C AGGAC GGC AGC GGT GGAAGGCC CT T T T GT CAC CT T G GAC AT G 1320 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 12 61 CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGACATG 1320 

Qy 1321 GAAGACT GT GGCT ACAAC AT T C CAC AGACAGAT GAGT CAAC CCT CAT GAC CAT AGC CT AT 1380 

I I I I I II I I II I I I I I I II I I I I I I I I I I I I I I I II I I I I I M I I I I I I I I I I I I I I I I I 

Db 1321 GAAGACTGT GGCT ACAAC AT T C CAC AGACAGAT GAGT CAAC CCT CAT GAC CAT AGC CT AT 1380 

Qy 1381 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 144 0 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1381 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 144 0 

Qy 14 41 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 14 41 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 


Qy 

Db 


15 01 AAGT GAGGAGGC CC AT GGGC AGAAGAT AGAGAT T C C C CT G GAC C AC AC CT CC GT GGTT C A 1560 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I.I I I I I I I I I I I I I I I I I I I I I I I 
1501 AAGT GAGGAGGC CC AT GGG C AGAAGAT AGAGAT T C C C CT GGAC CAC AC CT C C GT G GTT C A 1560 


Qy 1561 CT T T GGT CACAAGT AGGAGACAC AGAT GGCACCT GT G GC C AGAGC AC CT CAG GAC C CT C C 1620 

I I I I i I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I 
Db 1561 CT T T GGT CACAAGT AGGAGACACAGAT GGCACCT GT GGC C AGAGC AC CT CAGGAC C CT C C 1620 

Qy 1621 CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCAG 1680 

I I I I I I II I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 

Db 1621 CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCAG 1680 

Qy 1681 GGACT GTACCT GT AGGAAACAGAAAAGAGAAGAAAGAAGCACTCT GCTGGCGGGAATACT 174 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1681 GGACT GT AC CT GT AGGAAAC AGAAAAGAGAAGAAAGAAGCACT CT GCT GGCGG GAAT ACT 1740 

Qy 1741 CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCT 1800 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1741 CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCT 1800 

Qy 18 01 TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 1860 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I II I I I I I II I I I I I I I I I I I I 

Db 18 01 TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 18 60 

Qy 18 61 GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAAG 192 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 18 61 GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAAG 1920 

Qy 1921 AGAC CAAGCT T GTT T CCCTGCTGGC CAAAGT C AGT AG GAGAGGAT GC ACAGTT T GCT AT T 1980 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I M I I I I M M 
Db 1921 AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCTATT 1980 

Qy 1981 T GCTTTAGAGACAGGGACTGTATAAACAAGCCTAACATT GGT GCAAAGATT GC CT CTT GA 2040 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 

Db 1981 T GCTTTAGAGACAGGGACT GTATAAACAAGCCTAACATT GGT GCAAAGATT GC CT CTT GA 2 040 

Qy 2041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2070 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2070 


RESULT 8 
ABL49914 

ID ABL49914 standard; DNA; 2070 BP. 
XX 

AC ABL49914; 
XX 

DT 31-MAY-2002 {first entry) 
XX 

DE Human aspartyl protease nucleotide sequence SEQ ID NO:l. 
XX 

KW Beta-secretase ; enzyme; cleavage site; amyloid protein precursor; APP; 

KW aspartyl protease; neuroprotective; nootropic; beta-secretase inhibitor; 

KW Alzheimer's disease; gene; ds . 
XX 

OS Homo sapiens. 
XX 


PN WO200206306-A2. 
XX 

PD 24-JAN-2002. 
XX 

PF 19-JUL-2001; 2001WO-US23035 . 
XX 

PR 19-JUL-2000; 2 000US-2 197 95P . 

PR 12-MAR-2001; 2001US-275251P . 
XX 

PA (PHAA ) PHARMACIA & UPJOHN CO. 
XX 

PI Yan R, Tomasselli AG, Gurney ME, Eirunons TL, Bienkowski MJ; 

PI Heinrikson RL; 

XX 

DR WPI; 2002-216995/27. 

DR P-PSDB; ABB06409. 
XX 

PT Novel substrates for human aspartyl protease useful for identifying 

PT modulators of beta secretase activity of aspartyl protease for treating 

PT Alzheimer's disease 

XX 

PS Claim 1; Page 117; 188pp; English. 
XX 

CC The present invention describes an isolated peptide (I) comprising a 

CC sequence of at least four amino acids, where the peptide is a substrate 

CC for conducting aspartyl protease assays. (I) has neuroprotective and 

CC nootropic activities, and can be used as an inhibitor of beta-secretase 

CC activity. A beta-secretase modulator from the present invention can be 

CC used for inhibiting beta-secretase activity in vivo, and in the 

CC manufacture of a medicament for the treatment of Alzheimer's disease. 

CC Pharmaceutical compositions from the present invention can be used for 

CC treating a disease or condition characterised by an abnormal beta- 

CC secretase activity. (I) is useful for identifying agents that modulate 

CC the activity of human Asp2 aspartyl protease (Hu-Asp2). (I) is useful 

CC as a core structure to construct derivatives. ABL49914 to ABL49925 and 

CC ABB06409 to ABB06593 represent sequences used in the exemplification 

CC of the present invention. 

XX 

SQ Sequence 2070 BP; 476 A; 583 C; 562 G; 449 T; 0 other; 

Query Match 100.0%; Score 2070; DB 24; Length 2070; 
Best Local Similarity 100.0%; Pred. No. 0; 

Matches 2 070; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 

Db 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

Qy 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 120 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I M I I I I M I I II I I 

Db 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 120 

Qy 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 

Db 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 180 


Qy 


181 GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGATGACC 240 


Db 

1 O 1 

181 

Kj X (jC?AGA1 CjCj i \j bALAAL L. 1 ^J\^Kj\j\D^I\t\\j 1 1/W_.Vj 1 oorWj-r\J- ^ nv --^ 

240 

Qy 

241 

GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 

300 


i i i i i t i i i i i i i i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 I t 1 1 1 1 1 1 1 1 1 1 1 1 

I I I I I j 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 I 1 1 M 1 1 1 1 i 1 I 1 1 1 1 1 1 1 1 1 H I I 1 I 


Db 

241 

Qt TGGGCAGCCCCCCIjCACjACvjU 1 UAA^Al L-L, 1 bul bbAlALAbotiibLAbl/viL 1 1 1 kjWl 

300 

Qy 

301 

GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 

360 


i t i i i i i i i i i i i i i i i i i i i i t i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 1 1 1 1 [ 1 1 
1 M 1 1 1 M 1 M 1 1 1 I 1 II 1 1 1 1 1 1 II 1 M 1 1 1 1 1 II 1 1 1 1 1 11 II 1 1 II II 1 II 11 1 IN 


Db 

301 

GTGGGTGCTGCCCCCCACCCC1 1CC1GCA1 CGCIACIACCAGAGGCACjCKjI LLAbLAUA 


Qy 

361 

TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 

420 


i i i i i i i i i i i i i i i t i i i i i i i i i i i i i i i i i i i i i i i i i i i i i t I t I I i I t I I l l l l l 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IE 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 

361 

__ ^ „ „ „ „ -f. s~t s-*rn /~* /"< s~* t\ tv /^/^/^m/^m/^rTiTvrp/^rp/^/^^j^mTV 7\ /"* /"* A r* C* C r*' 1\ A r^T 1 r~" i"" r"* A A i~* iT"" 1 7\ f"* 

TACCGGGACCTCCGGAAGGGTGTGTATGI GCCC 1 ACACCCAGGGCAAG1 Cj(j(jAACj(j(jIjAIj 

a 9 n 

fi z u 

Qy 

421 

CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 

480 


i i i i i i i i i i i i t i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i t i i 1 1 1 1 1 1 1 
II I I 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 

421 

CTGGGCACCGACCTGGTAAGCATCCCCCA1 GGCCCCAACGI LALlblbLbl bbbAAbAl 1 

fl o u 

Qy 

481 

GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGaAGGCATCCTG 

540 


i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i t i i i i i i i i i i i i i t 1 1 1 1 1 1 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 i 1 1 1 1 1 


Db 

481 

GCTGCCATCACTGAATCAGACAAGTTCTI CAI CAACGGCl LLAAbl bbbAAbbbAl L^lb 

D *± U 

Qy 

541 

GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 

600 


1 1 1 1 1 1 1 [ 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 { 1 1 1 1 1 1 1 1 1 1 ] 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 

1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 11 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 M 


Db 

541 

GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGAC1 CCC1 GGAGCC1 11L11 IbALlLl 

DUU 

Qy 

601 

CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 

660 


> 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 

I | | | | | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 

601 

CTGGTAAAGCAGACCCACGTTCCCAACC1 CI 1 CI bUAUUl 1 Iblbblbbibbbl l u 

DDU 

Qy 

661 

CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGGTATC 

720 


i i i t i i i i i i i i i t I I I I I 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 t 1 ! 1 1 1 1 1 1 1 t 1 1 1 1 1 

1 1 1 M 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I IE 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 


Db 

661 

/-i^/^^irnnTi tv r~* r*~r<< /-*m /~*TT\f~>~T\ 7v /^m/^f^m/^/^r^r^rp/^rpy^rTi,~'/"/~ , 7\ ^/^/"A^r^AT^^AT 1 ^AT n T n i^i^A^^'T'A , T 1 ^ 

CCCCTCAACCAGTCTGAAGTGCTGGCCrCl Gl CGGAGGCjACjCAI (jAI CAI 1 1 Al 

79 0 

Qy 

721 

GACCACT C G CT GT AC AC AG GCAGT CT CT GGT AT ACAC C CAT C C GGC G GGAGT GGT AT TAT 

780 


i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 I 
I | | | | | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 


Db 

721 

GACCACTCGC I G 1 ACACAGGCAG 1 C 1 C 1 GG 1 Al ACACCUAl \j\d\dP&3 1 \j\d 1 Al 1 Al 

7 9. n 

Qy 

781 

GAGGT CAT CAT T GT GCGGGT G GAGAT C AAT GGAC AG GAT CT GAAAAT GGACT GCAAGGAG 

840 


1 1 1 1 1 1 1 1 1 1 M 1 1 t 1 1 1 i 1 1 1 1 1 1 1 1 1 I l l l l l l l l l 1 1 1 1 1 1 1 I 1 II 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 M 1 I I 1 t II 1 1 1 M 1 1 1 1 1 1 1 i 1 1 1 1 M 1 t 1 1 1 1 1 1 II 


Db 

781 

GAGGT CAI CAI lol (jtCVjCjIjI (jtIjACjAI LAA1 buALAbbAl bl vj/\iA/A/\l 

R 4 0 

Qy 

841 

TAC AACT AT GACAAGAGC AT T GT GGACAGT GGCAC C AC C AACCT T C GTT T GCC CAAGAAA 

900 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 II 


Db 

841 

tacaactatgacaagagcattgtggacagtggcaccaccaacctt'cgtttgcccaagaaa 

900 

Qy 

901 

GT GTT T GAAGCT GCAGT CAAAT C CAT C AAGGC AGC CT C CT CC AC GGAGAAGTT C C CT GAT 

960 



1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 

901 

GT GTT T GAAG CT GCAGT CAAAT C CAT C AAGGC AGC C T C CT C CAC GGAGAAGT TC C CT GAT 

960 

Qy 

961 

GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 

1020 

Db 

961 

I I 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 

GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 

1020 


Qy 1021 TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCACC 1080 

I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I 


1021 TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCACC 108 0 

1081 AT C CT T C C GCAGCAAT AC CT GC GG C C AGT G GAAGAT GT GGC CAC GT CC CAAGACGACT GT 1140 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1081 AT C CT T C C GCAGCAAT AC CT GC GG C C AGT GGAAGAT GT G GC CAC GT CC CAAGACGACT GT 1140 

1141 T ACAAGT T T GC C AT CT CAC AGT CAT CC AC GGGC AC T GT TAT GG GAGCT GTT AT CAT GGAG 12 00 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
1141 T ACAAGT T T GC C AT CT CAC AGT CAT CCAC GGGCAC T GT TAT GGGAGCT GT TAT CAT G GAG 12 00 

1201 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1260 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 

1201 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 12 60 

12 61 CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGACATG 132 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I 

1261 CAT GT GCAC GAT GAGT T CAG GAC GGC AGC GGT GGAAGG C CC TT TT GTCAC CT T GGACAT G 1320 

1321 GAAGACT GT GGCTACAACAT T CCAC AGACAGAT GAGT CAAC CCT C AT GAC CAT AGC CT AT 138 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1321 GAAGACT GTGGCTACAACATT CCACAGACAGAT GAGT CAACCCTCATGACCATAGCCT AT 138 0 

1381 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 144 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1381 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 144 0 

1441 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 

I I I I I I I I I ! I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1441 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 

1501 AAGT GAGGAGGCC C AT GGGCAGAAGAT AGAGAT T C C C C T GGAC CAC AC CT C C GT GGT T C A 1560 

I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1501 AAGT GAGGAGGCC CAT GGGCAGAAGAT AGAGATT C C C C T GGAC C AC AC CT C C GT GGT T C A 1560 

1561 CTTTGGTCACAAGTAGGAGACACAGATGGCACCTGTGGCCAGAGCACCTCAGGACCCTCC 162 0 

I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I M I I I I I II I I I I I I I I I I I I I I I I I 
1561 CT T T GGT C ACAAGT AGGAGACACAGAT GGCAC CT GT GGC CAGAGCACCT CAGGAC C CT C C 1620 

1621 CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCAG 168 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I M I I I 

1621 CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCAG 168 0 

1681 GGACT GTAC C T GT AGGAAAC AGAAAAGAGAAGAAAGAAGCACT CT GCT G GC GGGAAT ACT 174 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I 
1681 GGACT GTAC CT GT AGGAAAC AGAAAAGAGAAGAAAGAAG CACT CT GCT GGC GGGAAT ACT 1740 

1741 CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCT 18 00 

I I I II I I I I I I I I I I I I I I I I I I I I I E I I I I 1 II I I I I I I I I I I I I I I II I I I I I I I I I I 
1741 CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCT 18 00 

1801 TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 1860 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1801 TT GT CC AC CAT T C CTT T AAATT CT C CAAC C CAAAGT AT T CT T CT T T T CT T AGT TT CAGAA 1860 

1861 GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAAG 192 0 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I 
1861 GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAAG 1920 


Qy 1921 AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCTATT 1980 

I I I I I I I I II I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I > I I I I I I 
Db 1921 AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCTATT 1980 


Qy 


Db 


1981 T GCT TT AGAGACAGGGACT GT ATAAACAAG C CT AAC AT T GGT G CAAAGAT TGCCTCTT GA 2040 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
1981 T GCTTT AGAGACAGGGACT GT AT AAACAAG C CT AAC AT T GGT G CAAAGAT TGCCTCTT GA 2040 


QY 


2041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2070 



Db 


2041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2070 


RESULT 9 

AAA59553 

ID 

AAA59553 standard; DNA; 16080 BP. 

XX 


AC 

AAA59553; 

XX 


DT 

14-NOV-2000 (first entry) 

XX 


DE 

DNA clone pCEK CI. 27 encoding a human beta-secretase enzyme. 

XX 


KW 

Beta-secretase; beta-amyloid precursor protein; beta-amyloid peptide; 

KW 

amyloid plaque component; Alzheimer f s disease; amyloidogenic disease; 

KW 

inhibitor; ss. 

XX 


OS 

Homo sapiens . 

XX 


PN 

WO200047618-A2. 

XX 


PD 

17-AUG-2000. 

XX 


PF 

10-FEB-2000; 2 000WO-US03819 . 

XX 


PR 

10-FEB-1999; 99US-01 19571 . 

PR 

15-JUN-1999; 99US-0139172 . 

XX 


PA 

(ELAN-) ELAN PHARM INC. 

XX 


PI 

Anderson JP, Basi G, Doane MT, Frigon N, John V, Power M; 

PI 

Sinha S, Tatsuno G, Tung J, Wang S, McConlogue L; 

XX 


DR 

WPI; 2000-533011/48. 

XX 


PT 

Purified beta-secretase protein used in assays to discover inhibitors 

PT 

which can be used for the treatment of amyloidogenic diseases e.g. 

PT 

Alzheimer's disease - 

XX 


PS 

Disclosure; Fig 13A-E; 121pp; English. 

XX 


cc 

The specification describes a beta-secretase enzyme. The enzyme cleaves 

cc 

beta-amyloid precursor protein to produce beta-amyloid peptide. This 

cc 

enzyme is therefore implicated in the production of amyloid plaque 

cc 

components which accumulate in the brains of individuals afflicted with 

cc 

Alzheimer's disease. Inhibitors of beta-secretase are administered to 


CC a mammalian subject e.g. with Alzheimer's disease or Alzheimer's 

CC disease-like pathology to test if they maintain or improve cognitive 

CC ability or reduce the plaque burden. The compounds are used for the 

CC treatment of amyloidogenic diseases e.g. Alzheimer's disease. The 

CC present sequence encodes a human beta-secretase enzyme. 

XX 

SQ Sequence 16080 BP; 3627 A; 4556 C; 3962 G; 3913 T; 22 other; 


Query Match 98.9%; Score 2047.8; DB 21; Length 16080; 

Best Local Similarity 99.7%; Pred. No. 0; 

Matches 2052; Conservative 0; Mismatches 7; Indels 0; Gaps 0; 


Qy 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db • 1675 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 1734 

Qy 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 

Db 1735 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 17 94 

Qy 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 

Db 17 95 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 1854 


Qy 181 GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGATGACC 24 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1855 GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGATGACC 1914 


Qy 241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 

Db 1915 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 1974 


Qy 301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I II I I I I I I I I I I I I I I I I I 

Db 1975 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 2034 


Qy 361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 420 

I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2035 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 2094 


Qy 421 CT GGGC AC C GAC CT G GT AAGC AT C CC C CAT GG C C C CAAC GT C ACT GT GC GT GC CAACAT T 480 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2095 CT GGGC AC C GAC CT G GTAAGCAT C CC C CAT GG C C C CAAC GT C ACT GT GC GT GC CAAC AT T 2154 


Qy 4 81 GCT GC C AT CACT GAAT- CAGACAAGTT CT T CAT CAAC GGCT C CAAC T G GGAAGGCAT C CT G 54 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2155 GCT GC C AT CACT GAAT CAGACAAGTT CT T CAT CAAC GGCT C CAAC T G GGAAG GCAT C C T G 2214 


Qy 541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2215 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 2274 


Qy 601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 

Db 2275 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCAGCTTTGTGGTGCTGGCTTC 2334 


Qy 


661 


C C C C T CAAC CAGT CT GAAGT GCT GGC CT CT GT C G GAGGGAGC AT GAT CAT T GGAGGT AT C 720 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 

Db 2335 C C C CT CAAC C AGT CT GAAGT GCTGGCCTCT GT CGGAGGGAGCAT GAT CAT T G GAG GT AT C 2394 

Qy 721 GAC CACT C G CT GT AC ACAGGC AGT CT CT GGT AT ACAC C CAT C C GGC GGGAGT GGT AT TAT 78 0 

I II I I I I I I M I I II I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I M I 

Db 2395 GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTATTAT 2454 

Qy 781 GAG GT CAT CATT GT G C GGGT GGAGAT CAAT GGAC AG GAT CT GAAAAT GGACT GCAAG GAG 84 0 

I I I M I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2455 GAGGT CAT CATT GT G C GGGT GGAGAT CAAT GGACAG GAT CT GAAAAT GGACT GCAAG GAG 2514 

Qy 841 T ACAACT AT GACAAGAGC AT T GT GGAC AGT GGCAC C AC CAAC CT T C GT T T GC C CAAGAAA 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2515 T ACAACT AT GACAAGAGC AT T GT G GACAGT GG CAC C AC CAAC CTTCGTTTGCC CAAGAAA 2574 

Qy 901 GT GTTT GAAGCT GC AGT CAAAT C CAT CAAGGCAG C CT C C TC CAC GGAGAAGT T C C CT GAT 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2575 GT GTT T GAAGCT GC AGT CAAAT C CAT CAAGGC AGC CT C CTC CAC GGAGAAGT T CC CT GAT 2 634 

Qy 961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 102 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I M I I I I I 

Db 2635 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 2694 

Qy 1021 TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCACC 10 8 0 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I 
Db 2 695 T T C CC AGT CAT CT CACT CT AC CT AAT GGGT GAGGT T AC CAAC C AGT CCT T C C G CAT CAC C 27 54 

Qy 1081 AT C CT T C C GC AGCAAT AC CT GC GGC CAGT GGAAGAT GT GGC CAC GT CC CAAGACGACT GT 114 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I II I I I I I I I II I I I I I I I I I I I I I 
Db 2755 AT C CT T C C G C AGCAAT AC CT GC GGC CAGT G GAAGAT GT GGC CAC GT CC CAAGACGACT GT 2814 

Qy 1141 T ACAAGT T T GC CAT CT CAC AGT C AT CCAC G GGCACT GT TAT GGGAGCT GTT AT CAT G GAG 12 00 

I I I I I I II II I I I I I I I II I 1 j I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 11 
Db 2 815 T ACAAGT T T GC CAT CT CAC AGT CAT CCAC GC G CACT GT TAT G GGAG CT GTT AT CAT G GAG 2874 

Qy 1201 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1260 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
Db 2 875 GCCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 2934 

Qy 1261 CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGACATG 1320 

I I I M I I I I I I I I I II I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2935 CAT GT GCAC GAT GAGT T C AGGAC GGC AGC GGT GGAAGGC CCT T T T GT CAC CT T GGAC AT G 2994 

Qy 1321 GAAGACT GT GGCT ACAAC AT T CC ACAGACAGAT GAGT CAAC C CT CAT GAC CAT AGC C TAT 1380 

I 1 I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 

Db 2 995 GAAGACT GT GGCTACAACATTCCACAGACAGATGAGTCAACCCT CATGACCATAGCCT AT 3054 

Qy 1381 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 14 4 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 3055 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 3114 

Qy 1441 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 

Db 3115 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 3174 

Qy 1501 AAGT GAG GAGGCC C AT GGGCAGAAGAT AGAGATT C C C CT GGACC ACACCT C C GT GGT T CA 1560 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I Mill 


Db 


3175 


AAGT GAGGAGGC CCAT GGGCAGAAGAT AGAGATT CCCCT GGACCACAC CTCCGT CGTT CA 3234 


Qy 1561 CTT T GGT CACAAGT AG GAGAC AC AGAT GGC AC CT GT GG C CAGAGCAC C T CAGGAC C CT C C 162 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 3235 CTT T GGT CACAAGT AGGAGAC AC AGAT GG C AC CT GT G GCC AGAGCAC C T CAGGAC C CT C C 3294 

Qy 1621 CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCAG 168 0 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 

Db 32 95 CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCAG 3354 

Qy 1681 GGACT GTACCTGTAGGAAACAGAAAAGAGAAGAAAGAAGCACTCT GCT GGCGGGAATACT 174 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 3355 GGACTGTACCTGTAGGAAACAGAAAAGAGAAGAAAGAAGCACTCTGCT GGCGGGAATACT 3414 

Qy 17 41 CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCT 1800 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I 
Db 3415 CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCT 3474 

Qy 18 01 TTGTCCACCATTCCTTTA7UVTTCTCC7^CCC7U\AGTATTCTTCTTTTCTTAGTTTCAGAA 18 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I 1 I I I I I I I I I I I I I I I II I I I I I I I I 
Db 3475 TTGTCCACCATTCCTTTAAATTCTCCAACCCTVAAGTATTCTTCTTTTCTTAGTTTCAGAA 3534 

Qy 1861 GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAAG 1920 

I I I I I I I M I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I III I I I I I I I I 
Db 3535 GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAAG 3594 

Qy 1921 AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCTATT 1980 

I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
Db 3595 AGAC CAAG CTT GT T T C C CT GCT GGC CAAAGT C AGT AGGAGAGGAT GC AC AGT T T GCT ATT 3654 

Qy 1981 T GCT T T AGAGAC AGGGACTGT AT AAACAAGC C T AAC AT T GGT GCAAAGAT T GC CT CTT GA 2040 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
Db 3655 T GCT T T AGAGAC AGGGACT GT AT AAACAAGC C T AAC AT T GGT GCAAAGAT T GC CT CTT GA 3714 

Qy 2041 ATTAAAAAAAAAAAAAAAA 2059 

I I I I I I I I I I I I I I I I 
Db 3715 ATTAAAAAAAAAAACTAGA 3733 


RESULT 10 
AAA59551 

ID AAA59551 standard; DNA; 2348 BP. 
XX 

AC AAA59551; 
XX 

DT 14-NOV-2000 (first entry) 
XX 

DE DNA encoding a human beta-secretase enzyme. 
XX 

KW Beta-secretase; beta-amyloid precursor protein; beta-amyloid peptide; 

KW amyloid plaque component; Alzheimer's disease; amyloidogenic disease; 

KW inhibitor; ss. 
XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualif iers 

FT CDS 306.. 1811 


FT /*tag= a 

FT /product^ "beta-secretase" 
XX 

PN WO200047618-A2 . 
XX 

PD 17-AUG-2000. 
XX 

PF 10-FEB-2000; 2000WO-US03819 . 
XX 

PR 10-FEB-1999; 99US-0119571 . 

PR 15-JUN-1999; 99US-0139172 . 
XX 

PA (ELAN- ) ELAN PHARM INC. 
XX 

PI Anderson JP, Basi G, Doane MT, Frigon N, John V, Power M; 

PI Sinha S, Tatsuno G, Tung J, Wang S, McConlogue L; 

XX 

DR WPI; 2000-533011/48. 

DR P-PSDB; AAB07896. 
XX 

PT Purified beta-secretase protein used in assays to discover inhibitors 

PT which can be used for the treatment of amyloidogenic diseases e.g. 

PT Alzheimer's disease - 
XX 

PS Disclosure; Fig IB; 121pp; English. 

XX 

CC The specification describes a beta-secretase enzyme. The enzyme cleaves 

CC beta-amyloid precursor protein to produce beta-amyloid peptide. This 

CC enzyme is therefore implicated in the production of amyloid plaque 

CC components which accumulate in the brains of individuals afflicted with 

CC Alzheimer's disease. Inhibitors of beta-secretase are administered to 

CC a mammalian subject e.g. with Alzheimer's disease or Alzheimer's 

CC disease-like pathology to test if they maintain or improve cognitive 

CC ability or reduce the plaque burden. The compounds are used for the 

CC treatment of amyloidogenic diseases e.g. Alzheimer's disease. The 

CC present sequence encodes a human beta-secretase enzyme. 

XX 

SQ Sequence 2348 BP; 489 A; 713 C; 661 G; 484 T; 1 other; 

Query Match 98.5%; Score 2039.8; DB 21; Length 2348; 
Best Local Similarity 99.9%; Pred. No. 0; 

Matches 2041; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 

Qy 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

I I I I I I I I I I I I I [ I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 

Db 306 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 365 

Qy 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 12 0 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I 

Db 366 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 425 

Qy 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 180 

I I I M I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I 

Db 426 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 485 

Qy 181 GTGGAGAT GGT GGACAAC CT GAGGGGCAAGT CGGGGCAGGGCT ACT ACGT GGAGAT GACC 24 0 
I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I M I I I 


Db 


486 GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGATGACC 54 5 


Qy 241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I M I I I I I I I I I I I I I I I I I I 

Db 546 GT GG GC AG C C CCC C GCAGAC GCT CAAC AT C CT G GT GGAT AC AGG C AGCAGTAACT TT GC A 605 

Qy 301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 606 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 665 

Qy 361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 420 

I I I I M I I I I I I ! I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I M I I I I I I II I I I I I 

Db 666 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 725 

Qy 421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 480 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 

Db 726 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 7 85 

Qy 481 GCT GC C AT C ACT GAAT C AGACAAGTT CT T CAT CAAC GGCT C CAAC T GGGAAG GCAT C CT G 54 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 786 GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATCCTG 845 

Qy 541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 846 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 905 

Qy 601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 906 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCAGCTTTGTGGTGCTGGCTTC 965 

Qy 661 CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGGTATC 72 0 

I | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I M I I I I I 

Db 966 CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGGTATC 1025 

Qy 721 GAC CACT C GCT GT ACAC AGGCAGT CT CT GGT AT ACAC C CAT C CGGC GGGAGT G GT ATT AT 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 

Db 1026 GAC CACT C G CTGT AC AC AGGCAGT CT CTGGT ATAC AC CCAT C CGGC G GGAGT GGT ATT AT 1085 

Qy 781 GAGGT CATCATTGTGCGGGT GGAGATCAAT GGACAGGAT CT GAAAAT GGACT GCAAGGAG 84 0 

Mill I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I 

Db 108 6 GAGGT GAT CATTGTGCGGGT GGAGAT CAAT GGACAGGAT CT GAAAAT GGACT GCAAGGAG 1145 

Qy 841 T ACAACT AT GACAAGAGCAT T GT GGACAGT G GCACCAC CAAC CTT C GT T T GC C CAAGAAA 900 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I 

Db 1146 T ACAACT AT GACAAGAGCAT T GT GGACAGT GGC AC CACCAAC CTTCGTTTGCC CAAGAAA 1205 

Qy 901 GT GT T T GAAGCT GC AGT CAAAT C CAT C AAGGC AGC CT C CT C C ACGGAGAAGT T C C CT GAT 960 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1206 GT GT T T GAAGC T GCAGT CAAAT C CAT CAAGG C AGC C T C CT C CAC GGAGAAGT T C C CT GAT 12 65 

Qy 961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 102 0 

I | | | I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1266 GGTTTCT GGCT AGGAGAGCAGCT GGT GTGCTGGCAAGCAGGCAC CAC CCCTTGGAACATT 1325 

Qy 1021 T T C C CAGT C AT CTCACT CT AC CTAAT GGGT GAGGTT AC CAAC CAGT C CT T CC GCAT CACC 1080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 

Db 1326 TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCACC 1385 


Qy 1081 AT C CT T C C GCAGCAAT AC CT G C GG CCAGT GGAAGAT GT GG C CAC GT C C CAAGAC GACTGT 114 0 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I M I 
Db 138 6 AT C CT T C C GCAGCAAT AC CT G C GG CCAGT GGAAGAT GT GGC CAC GT C C CAAGAC GACT GT 144 5 

Qy 1141 T ACAAGT T T GC CAT CT C ACAGT C AT CC AC G GGC ACT GT TAT GGGAG C T GTT AT CAT GGAG 1200 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 144 6 T ACAAGT T T GC C AT CT C ACAGT CAT CC AC G GGC ACT GT TAT GGGAG CT GTT AT CAT GGAG 1505 

Qy 1201 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1260 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I II 

Db 1506 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1565 

Qy 1261 CAT GT GC ACGAT GAGTT C AGGAC GGCAGC GGT GGAAGGC C CTT T T GT CACC T T G GAC AT G 1320 

I I I I I I I I I I I I I I I I I I II I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I 
Db 1566 CAT GT GC ACGAT GAGT T C AGGAC G GCAGC GGT GGAAGGC C CTT T T GT CACC T T GGAC AT G 162 5 

Qy 1321 GAAGACT GTGGCTACAACATT CCACAGACAGATGAGT CAAC CCT CAT GACCAT AGCCTAT 138 0 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I 
Db 162 6 GAAGACT GTGGCTACAACATT CCACAGACAGAT GAGT CAAC CCT CAT GACCAT AGCCTAT 1685 

Qy 1381 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 144 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I 

Db 1686 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 1745 

Qy 1441 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I 
Db 174 6 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1805 

Qy 1501 AAGT GAG GAGGC C CAT GGGCAGAAGAT AGAGAT T C C C CT GGACC AC AC CT CC GT GGTT CA 1560 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I 
Db 18 06 AAGT GAGGAGGCCCAT GGGCAGAAGAT AGAGATTCCCCT GGAC CACACCTCC GT GGTT CA 18 65 

Qy 1561 CT T T GGT C ACAAGT AGGAGAC AC AGAT GGCAC CTGT GGC CAGAGC AC CT CAGGAC C CT C C 1620 

I I I I I I I I II I II I I I I I II I I I I I I I I I M I I I I I I M I I I I I I I I I I I I I I I I I I M I 

Db 1866 CT T T GGT C ACAAGT AGGAGAC AC AGAT GGC AC CT GT GGC CAGAGC AC CT CAGGAC C CT CC 1925 

Qy 1621 C CAC C CAC CAAAT G C CT CT GC CTT GAT G GAGAAGGAAAAGGCT G GCAAGGT GGGT T C C AG 168 0 

I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 192 6 C CAC C CAC CAAAT GCCTCTGC CTT GAT GGAGAAGGAAAAGGCTGGCAAGGTG GGTT CC AG 1985 

Qy 1681 GGACT GT AC CT GT AGGAAAC AGAAAAGAGAAGAAAGAAGCACT CT G CT GGCGGGAAT ACT 174 0 

M | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I M I I 
Db 198 6 G GACT GT AC CT GT AGGAAACAGAAAAGAGAAGAAAGAAG CACT CT GCT GGCGGGAAT ACT 2 045 

Qy 1741 CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCT 1800 

I | | | | I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I 

Db 2 04 6 CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCT 2105 

Qy 1801 TTGTCCACCATTCCTTTAAATTCTCC7^ACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 1860 

I I I I I I I I I I [ I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I II 

Db 2106 TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGT^A 2165 

Qy 1861 GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAAG 1920 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I M I M I I I I I I I I I I I I I M I I I I I I I I I 
Db 2166 GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAAG 2225 


Qy 1921 AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCTATT 1980 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I 

Db 2226 AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCTATT 2285 

Qy 1981 T GCT T T AGAGACAGGGACT GT AT AAACAAG C CTAAC ATTG GT GCAAAGAT TGCCTCTT GA 2 04 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I II 
Db 228 6 T GCT T T AGAGACAGGGACT GT AT AAACAAGC CTAACAT T G GT G CAAAGAT TGCCTCTT GA 234 5 

Qy 2041 ATT 2043 

I I I 

Db 2346 ATT 2348 


RESULT 11 
AAV41696 

ID AAV41696 standard; cDNA; 2541 BP. 
XX 

AC AAV41696; 
XX 

DT 26-OCT-1998 (first entry) 
XX 

DE Nucleotide sequence of human ASP2 (aspartic protease 2). 
XX 

KW Human; ASP2; aspartic protease 2; agonist; antagonist; immunospecif ic; 

KW antibody; inhibition; Alzheimer ! s disease; cancer; proteinase; 

KW prohormone processing; ss. 
XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT CDS 1..1506 

FT /*tag= a 

FT /product^ "human ASP2 " 

XX 

PN EP855444-A2. 
XX 

PD 29-JUL-1998. 
XX 

PF 27-JAN-1998; 98EP-03 00573 . 
XX 

PR 28-JAN-1997; 97GB-0001684 . 
XX 

PA (SMIK ) SMITHKLINE BEECHAM CORP. 

PA (SMIK ) SMITHKLINE BEECHAM PLC . 
XX 

PI Chapman CG, Murphy K, Powell DJ, Smith TS; 
XX 

DR WPI; 1998-389809/34. 

DR P-PSDB; AAW59807. 
XX 

PT New nucleic acid encoding human aspartic protease 2 - used to treat, 

PT prevent and diagnose e.g. Alzheimer's disease, cancer and prohormone 

PT processing 
XX 

PS Claim 2; Pages 6-7; 26pp; English. 
XX 

CC This is the nucleotide sequence of the human ASP2 (aspartic protease 2) , 


CC used in the method of the invention. Agonists and antagonists for 

CC ASP2 immunospecif ic antibodies are used to treat conditions requiring 

CC increased or decreased activity or expression of ASP2 respectively. 

CC ASP2 is used to treat and diagnose e.g. Alzheimer's disease, cancer 

CC and prohormone processing and ASP2 or a fragment can be used to induce 

CC an immune response against the above conditions. 
XX 

SQ Sequence 2541 BP; 598 A; 673 C; 675 G; 579 T; 16 other; 

Query Match 98.2%; Score 2032; DB 19; Length 2541; 

Best Local Similarity 99.5%; Pred. No. 0; 

Matches 2049; Conservative 0; Mismatches 10; Indels 1; Gaps 1; 

ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

I | | | | I I I M I I I M I I I I I I I I I I I I II I I I I I I I I I I I I I 1 I I I I I I I I M I I I I I I I 

ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 12 0 

I I I I I I I I II I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I M I I I I 

GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 12 0 

CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 18 0 
I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 18 0 

GTGGAGATGGTGGACAAC CT GAGGGGCAAGT C GGGGCAGGGCTACTACGT GGAGAT GACC 2 4 0 

I | I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I 

GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGATGACC 24 0 

GT GGGCAG CC C C C C GCAGAC GCT CAACAT C CT GGT GGAT ACAGG C AGCAGTAACT T T GC A 300 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I M I I I I I I 
GT GGG C AGCC C C C C GCAGAC GCT CAACAT C CT GGT GGAT ACAG GC AG CAGT AACT T T GCA 300 

GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I 

GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 42 0 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

TACCGGGACCTCCGGAAGGGTGTGTATGAGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 42 0 

CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 4 8 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I 

CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 4 8 0 

GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATCCTG 54 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 

GCT GC C AT CACT GAAT CAGACAAGT T CT T CAT CAAC G GCT C CAACT GG GAAGGC AT C C T G 54 0 

GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 

GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

I | I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 

CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCAGCTTTGTGGTGCTGGCTTC 660 


Qy 

1 

Db 

1 

Qy 

61 

Db 

61 

Qy 

121 

Db 

121 

Qy 

181 

Db 

181 

Qy 

241 

Db 

241 

Qy 

301 

Db 

301 

Qy 

361 

Db 

361 

Qy 

421 

Db 

421 

Qy 

481 

Db 

481 

Qy 

541 

Db 

541 

Qy 

601 

Db 

601 


Qy 

661 

C C C CT CAAC CAGT CT GAAGT GCT GGC C T CT GT C GGAGGGAGCAT GAT CATT G GAG GT AT C 

720 


I I I I I I I I 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 


Db 

661 

CCCCTCAACLALI L 1 LAAL1 LLI (jbLLl LI LI LLLALbLrAbrUAI IxAI L,A1 I bb/ibbini ^ 

720 

Qy 

721 

GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTATTAT 

780 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 

721 

GACCACTCGCTGTACACALLLAL I L I L I LL iAIALALtLAl LLLLLLLLAL 1 l a /\i 

7 80 

Qy 

781 

GAGGTCATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGCAAGGAG 

840 


Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 11 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 


Db 

781 

GAGGTGATCATTGTGCGGGTGGAGATCAATGGALALLAI Li LAAAAI LLALI LLAAbrijAbr 


Qy 

841 

T ACAACT AT GACAAGAGCAT T GT GGAC AGT GGC AC C AC CAAC CT T C GT T T GC C CAAGAAA 

900 


1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 

841 

TACAACTATGACAAGAGCATTGTGGACAGIGLLALLALLAALLI ILL 1 1 ILLLLAALAAA 

z? U U 

Qy 

901 

GT GTT T GAAGCT GC AGT CAAAT C CAT CAAG GCAGC CT C CT C C AC GGAGAAGT T CC CT GAT 

960 


1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 

901 

GT GTT T GAAGCT GCAGT CAAAT C CAT C AAGG CAGC CT C C 1 L CAL LLALAAL 1 ILLLI LAI 


Qy 

961 

GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 

1020 


I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 

961 

GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAALLALLLALLALLLLI I bbAAtAl 1 

i non 

Qy 

1021 

TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCACC 

1080 


I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 


Db 

1021 

T T C CCAGT CAT CT CAC T CT AC CTAAT GGGT GAGGT T AC CAAC CAGT CCTT C C GLA I LAL L 

i n o n 

IUDU 

Qy 

1081 

AT C CTT C C G C AGCAAT ACCT GC G GC C AGT GGAAGAT GT G GC C AC GT C C CAAGACGACT GT 

1140 


1 1 1 1 1 1 11 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 

1081 

AT CCTT CCGCAGCAATACCTGCGGCCAGT GGAAGAT GTGGCCAL LI LLLAALALLAL1 LI 


Qy 

1141 

T ACAAGTT T GC CAT CT CAC AGT C AT CCAC GGGC ACT GT TAT GGGAG CT GT TAT CAT GGAG 

1200 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 

1141 

TACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTArGLLALLI LI I AI LAI LLAL 

IZUU 

Qy 

1201 

GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 

1260 


1 1 1 M 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 M 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 


Db 

1201 

GGCTTCTACGTTGTCTTTGATCGGGCLLLAAAALLAAI ILLLI I I LLI Li LALLLLI ILL, 


Qy 

1261 

CAT GTGC AC GAT GAGT T CAGGAC G GC AG C GGT GGAAG GC CCTTT T GT C AC CT T GGAC AT G 

1320 


1 [ 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 

1261 

CATGTGCACGATGAGTTCAGGALLLLALLLLI LLAALLLLLi I I I LI LAL LI IbbALAlb 

i ^ 9 n 

Qy 

1321 

GAAGACT GT GGCT ACAAC AT T C C ACAGAC AGAT GAGT CAAC C CT CAT GAC CAT AGC C TAT 

1380 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 


Db 

1321 

/—t* 7\ >^Ttj'-*rn/"-'rT-i/-*/" , /~ t rpTv /~*7\ t\ r*T\rnrr\r^r^7\r*7\r^7\r*7\r , 7\ r Vf~*7\r''T , r i 7\ Aprp | T , r i 7\ r rr'Ar'r'7\ r l 1 z\rr'r , T2i r P 
GAAGACT GT GGCT ACAAC A I I LLALALALALAI LAL I LAALLL I LAI bALLAlAbbLlAl 

IjOU 

Qy 

1381 

GTCATGGCTGCCATCTGCLLLLI LI I LAI LLI LLLAL I L I LLLI LAI LLI LI LI bAbl Lrbr 

14 4 0 
x *± yj 


| | | | | | | | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 

1381 

GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 

1440 

Qy 

1441 

CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 

1500 



I | | | i | | | 1 1 1 1 1 1 ! 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 


Db 

1441 

CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 

1500 


Qy 


1501 AAGTGAGGAGGCCCATGGGCAGAAGATAGAGATTCCCCT-GGACCACACCTCCGTGGTTC 1559 


Db 1501 AAGT GAGGAGG C CCAT GGGAGAAAGAT AGAGAT T C CC CT GG GAC CAC AC CT C C GT GGT T C 1560 

Qy 1560 ACTTT GGTCACAAGTAGGAGACACAGAT GGCACCTGT GGCCAGAGCACCT CAGGAC C CT C 1619 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I II I I I I I I I I I I I I I I I I I I 
Db 1561 ACT T T GGT C ACAAGT AGGAGAC AC AGAT GGC AC CT GT G GC C AGAGC AC CT CAGGAC C CT C 1620 

Qy 1620 CCCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCA 1679 

I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1621 CCCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCA 1680 

Qy 1680 GGGACT GT AC CT GT AGGAAACAGAAAAGAGAAGAAAGAAGC ACT CT GCT GGC GG GAAT AC 1739 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I 1 I I I I I I I I I I I I I I I I 
Db 1681 GGGACT GTACCT GTAGGAAACAGAAAAGAGAAGAAAGAAGCACT CTGCT GGCGGGAAT AC 1740 

Qy 1740 TCTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACC 17 99 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I 
Db 1741 T CT T GGT CAC CT CAAATTTAAGTCGGGAAATT CTGCT GCT TGAAACTTCAGCCCTGAACC 1800 

Qy 1800 TTTGTCCACCATTCCTTT7VAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGA 1859 

I I I I I I I I I I I I I I i I I I I I I I I I I I II I I I I I I I I I I I II I I M I I I I I I I I I I I I I I I 
Db 1801 TTTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGA 18 60 

Qy 18 60 AGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAA 1919 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1861 AGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCGGGCAGAGAA 192 0 

Qy 1920 GAGAC CAAGCT T GT TT C C CT GCT GGCCAAAGT C AGTAGGAGAG GAT GCAC AGT T T GCT AT 197 9 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I II I I I I I I I I II I I I I I I I I I 
Db 1921 GAGAC CAAGCT TGTTTCCCTGCT GGCCAAAGT CAGT AGGAGAGGAT GCACAGT T T GC TAT 198 0 

Qy 1980 TT GCTT T AGAGACAGGGACT GT AT AAACAAGC CT AACATT GGT GCAAAGATT GCCT CTT G 2 039 

I II I I I I I I I I I II I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1981 TTGCTTTAGAGACAGGGACTGT AT AAACAAGCCT AACATT GGTGCAAAGATT GCCT CTT G 2 04 0 

Qy 2040 AATTAAAAAAAAAAAAAAAA 2059 

I I I I I I I I I I I I I I I I I 
Db 2041 AAT T AAAAAAAAAAACT AGA 2060 
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XX 

PI Tang JJN, Hong L, Ghosh AK; 
XX 
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DR P-PSDB; AAB66572 . 
XX 

PT Novel memapsin 2 inhibitors which bind to active site of memapsin 2 

PT having 2 catalytic aspartic residues and substrate binding cleft, used 

PT to treat Alzheimer's disease by blocking amyloid precursor protein 

PT cleavage 
XX 

PS Example 1; Page 70-71; 86pp; English. 
XX 

CC The present sequence is given in a specification relating to an inhibitor 

CC of catalytically active memapsin 2. The inhibitor binds to the memapsin 2 

CC active site, which is defined by the presence of two catalytic aspartic 

CC residues and a substrate binding cleft. The inhibitor is useful for 

CC the treatment and diagnosis of Alzheimer's disease. It is useful in 

CC screens for individuals with a genetic predisposition to Alzheimer's 

CC disease. The inhibitor is useful as a reagent for specifically binding to 

CC memapsin 2 or memapsin 2 analogues and for aiding in memapsin 2 

CC isolation, purification and characterisation. 

XX 

SQ Sequence 3252 BP; 804 A; 863 C; 811 G; 771 T; 3 other; 


Query Match 97.1%; Score 2010; DB 22; Length 3252; 

Best Local Similarity 99.8%; Pred. No. 0; 

Matches 2013; Conservative 0; Mismatches 5; Indels 0; Gaps 0; 

Qy 40 GCGGGAGTGCTGCCTGCCCACGGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGC 99 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 GCGGGAGTGCTGCCTGCCCACGGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGC 60 

Qy 100 CTGGGGGGCGCCCCCCTGGGGCTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAG 159 

I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I M I I I I I I II 
Db 61 CTGGGGGGCGCCCCCCTGGGGCTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAG 120 

Qy 160 CCCGGCCGGAGGGGCAGCTTTGTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAG 219 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 

Db 121 CCCGGCCGGAGGGGCAGCTTTGTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAG 180 


Qy 220 GGC T AC T AC GT G GAGAT GACC GT G GGC AGC C C C CC GCAGACG C T CAAC AT C CT G GT GGAT 279 

I | | | I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I 
Db 181 GGCTACTACGTGGAGATGACCGTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGAT 240 


280 ACAGGCAGCAGTAACTTTGCAGTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTAC 339 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 
241 ACAGGCAGCAGTAACTTTGCAGTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTAC 300 

34 0 CAGAGGCAGCTGTCCAGCACATACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACC 399 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 

301 CAGAGGCAGCTGTCCAGCACATACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACC 360 

400 CAGGGCAAGTGGGAAGGGGAGCTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAAC 459 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I II I I II I I I II I I I I I I I I 

361 CAGGGCAAGTGGGAAGGGGAGCTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAAC 420 

4 60 GT CAC TGT GC GT GC CAACAT T GCT GC C AT CACT GAAT CAGACAAGT T CTT C AT CAAC GGC 519 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
421 GT C ACT GT GC GT GC CAACAT T GCT GC CAT CACT GAAT CAGACAAGT T CTT CAT CAAC GGC 480 

520 TCCAACTGGGAAGGCATCCTGGGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCC 579 
I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I 

4 81 TCCAACTGGGAAGGCATCCTGGGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCC 54 0 

5 80 CTGGAGCCTTTCTTTGACTCTCTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTG 639 

I | | I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

541 CTGGAGCCTTTCTTTGACTCTCTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTG 600 

64 0 CACCTTTGTGGTGCTGGCTTCCCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGG 699 

II | | I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

601 CAGCTTTGTGGTGCTGGCTTCCCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGG 660 

7 00 AG CAT GAT CAT T GGAGGT AT C GAC CACT C GCT GT ACAC AGGCAGT CT CT GGT AT ACAC C C 759 

I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

661 AGCAT GAT CATT GGAGGT AT C GAC CACT CGCTGTACACAGGCAGT CT CTGGTATACACCC 720 

760 AT C C GGCG G GAGT GGT ATT AT GAGGT CAT CATT GT G C GGGT G GAGAT CAAT G GACAGGAT 819 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I M I 
721 AT CCGGCGGGAGTGGTATT AT GAGGTGAT CATT GTGCGGGT GGAGAT CAATGGACAGGAT 78 0 

820 CT GAAAAT GGACT GCAAGGAGT ACAACT AT GACAAGAGCAT T GT G GAC AGT GGCAC CAC C 879 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
7 81 CT GAAAAT GGACT GCAAGGAGT ACAACT AT GACAAGAGCATT GT G GAC AGTGGC AC CAC C 840 

880 AAC CT T CGT T T GCC CAAGAAAGT GT TT GAAGCT GC AGTCAAAT C CAT CAAGGC AGC CT C C 939 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
841 AAC CT T CGT T T GCC CAAGAAAGT GT TT GAAGCT GC AGTCAAAT C CAT CAAGGCAGC CT C C 900 

940 TCCACGGAGAAGTTCCCTGATGGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCA 999 

I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I II I I I I I I I I II I II I I I I I I I I I I I 

901 TCCACGGAGAAGTTCCCTGATGGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCA 960 

1000 GG CAC CAC C C CT T G GAACAT T T T C C C AGT CATCT C ACT CT AC CT AAT G GGT GAGGT T AC C 1059 
I | | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
9 61 GGCACCACCCCTTGGAACATTTTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACC 1020 

1060 AAC CAGT C CT T C C GC AT CAC CAT C CTT C C GCAGCAATACCT GC GGC C AGT GGAAGAT GT G 1119 

I | | | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1021 AAC CAGT C CTT C C GC AT CAC CAT C CTT C C GCAGCAATACCT GC GGCCAGT GGAAGAT GT G 1080 

1120 GC C AC GT C C CAAGAC GACT GT T ACAAGTT T GC C AT CT CAC AGT CAT C C AC GGGCACT GT T 1179 


1081 G C C AC GT C CCAAGACGACT GT T ACAAGT T T GC CAT C T CACAGT CAT C CAC G GGCACT GT T 1140 

118 0 ATGGGAGCTGTTATCATGGAGGGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATT 1239 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M M I 
1141 ATGGGAGCTGTTATCATGGAGGGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATT 1200 

124 0 GGCTTTGCTGTCAGCGCTTGCCATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGC 1299 

I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1201 GGCTTTGCTGTCAGCGCTTGCCATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGC 1260 

1300 C CT T TTGT C AC CTT GGAC AT GGAAGACT GT G GCT ACAAC AT T C C ACAGAC AGAT GAGT C A 1359 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1261 C CT T T T GT CAC CTT GGAC AT GGAAGAC T GT G GCT ACAAC AT T C C AC AGAC AGAT GAGT C A 1320 

1360 ACCCTCATGACCATAGCCTATGTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTC 1419 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I II 
1321 ACCCTCATGACCATAGCCTATGTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTC 1380 

1420 TGCCTCATGGTGTGTCAGTGGCGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTT 1479 

I | | | | | | II I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I 

1381 TGCCTCATGGTGTGTCAGTGGCGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTT 14 40 

1480 GCT GAT GAC AT CTCCCTGCT GAAGT GAGGAG GC C CAT GGGCAGAAGAT AGAGATT CC C CT 1539 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I M 
1441 GCT GAT GACAT CT C C CT GC T GAAGT GAG GAGGC C CAT GGGCAGAAGAT AGAGATT C C C CT 1500 

1540 GGAC C ACAC C T C C GT GGT T CACT TT GGT C AC AAGT AGGAGAC ACAGAT GGCAC C T GT G GC 1599 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I 
1501 GGAC C ACAC CT C C GT GGT T CACT T T GGT CAC AAGT AG GAGACACAGAT GGCAC CT GT GGC 15 60 

1600 C AGAG CACC T C AG GACCCT C C C C AC CCAC CAAAT GC CT CT GC CT TGAT GGAGAAGGAAAA 1659 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 
1561 C AGAGC AC CT CAGGAC CCT C C C CAC CCAC CAAAT GCCT CT GC CT T GAT G GAGAAGGAAAA 1620 

1660 GG CT GGCAAGGT GGGTT C CAGGGAC T GT AC CT GT AGGAAAC AGAAAAGAGAAGAAAGAAG 1719 

I I I I I I M I I I II M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I H I I I I I I I I I II I 
1621 GGCT GG CAAGGT GGGTT C CAGGGACTGT AC CT GT AGGAAAC AGAAAAGAGAAGAAAGAAG 168 0 

1720 CACT CT GCT GGCGGGAATACTCTTGGTCACCTCAAATTTAAGTCGGGAAATTCT GCT GCT 1779 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I 
1681 CACT CT GCT GGCGGGAATACT CTT GGTCACCTCAAATTTAAGTCGGGAAATTCT GCT GCT 17 40 

1780 TGAAACTTCAGCCCTGAACCTTTGT CCAC CATTCCTTTAAATTCTCCAACCCAAAGT ATT 1839 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1741 TGAAACTTCAGCCCTGAACCTTTGTCCACCATTCCTTTAAATTCTCCAACCCTW^GTATT 18 00 

1840 CTTCTTTTCTTAGTTTCAGAAGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCC 18 99 

I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I M I 
1801 CTTCTTTTCTTAGTTTCAGAAGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCC 18 60 

1900 T GT GGT ACC CT GGC AGAGAAGAGAC CAAGCT T GT TTCCCTGCTGGC CAAAGT C AGT AGGA 1959 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1861 TGTGGTACCCTGGCAGAGAAGAGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGA 1920 

1960 GAGGAT GCAC AGT T T G CT ATT T GCT T T AGAGACAGGGACT GT ATAAACAAGC CTAACATT 2 019 
I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 


Db 1921 GAGGAT G C AC AGTT T GC T AT T T GCTT T AGAGAC AGGGACT GTAT AAACAAG C C T AAC AT T 1980 

Qy 2020 GGTGCAAAGATTGCCTCTTGAATTAAAAAAAAAAAAAA 2057 

I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I 

Db 1981 GGT GCAAAGAT TGCCTCTT GAATT AAAAAAAAACTAGA 2 018 

RESULT 13 
AAF28101 

ID AAF28101 standard; DNA; 3252 BP. 
XX 

AC AAF28101; 
XX 

DT 02-APR-2001 (first entry) 
XX 

DE Memapsin 2 DNA. 
XX 

KW Memapsin 2; catalyst; Alzheimer's; ds . 
XX 

OS Homo sapiens . 
XX 

PN WO200100663-A2 . 
XX 

PD 04-JAN-2001. 
XX 

PF 27-JUN-2000; 2000WO-US17 661 . 
XX 

PR 28-JUN-1999; 99US-0141363 . 

PR 30-NOV-1999; 99US-0168060 . 

PR 25-JAN-2000; 2000US-0177836 . 

PR 27-JAN-2000; 2000US-0178368 . 

PR 08-JUN-2000; 2000US-0210292 . 
XX 

PA (OKLA-) OKLAHOMA MEDICAL RES FOUND. 
XX 

PI Tang JJN, Lin X, Koelsch G; 
XX 

DR WPI; 2001-102885/11. 
XX 

PT Purified recombinant catalytically active memapsin 2, used to screen 

PT inhibitors of it f which are used to treat and prevent Alzheimer's 

PT disease - 
XX 

PS Example 1; Page 71-72; 86pp; English. 
XX 

CC The present invention relates to a purified recombinant 

CC catalytically active memapsin 2. The invention may be used for 

CC isolating inhibitors which are used to treat or prevent 

CC Alzheimer's disease. The invention may also be used to screen 

CC for individuals more genetically prone to develop Alzheimer's 

CC disease. 

XX 

SQ Sequence 3252 BP; 804 A; 863 C; 811 G; 771 T; 3 other; 


Query Match 97.1%; Score 2010; DB 22; 

Best Local Similarity 99.8%; Pred. No. 0; 
Matches 2 013; Conservative 0; Mismatches 5; 


Length 3252; 
Indels 0; Gaps 


0; 


Qy 

40 

GCGGGAGTGCTGCCTGCCCACGGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGC 

99 


i i i i i i i i i t i t i i i i i i i i t i i i t i i i i i i i i i i i i i i i i i i i i [] I t t I l l l 1 1 1 1 I 1 
1 1 j 1 1 1 1 1 1! 1 II 1 1 1 It 1 II 1 I 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 II II II 1 IE I 1 1 1 1 1 1 1 1 l I 


Db 

1 

GCGGGAGTGCTGCC1 GCCCACGGCALCCAGCACGGCA1 UL-Ij^L-I ijOUL-d LjUUwVj^^^^-. 

60 

Qy 

100 

CTGGGGGGCGCCCCCCTGGGGCTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAG 

159 


i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 
1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 


Db 

61 

CTGGGGGGCGCGCCCC1 GGGGC1 GL.GGC1 GCUUC(j(jbAtjAUC(ji\Ukj/\A^^ 

12 0 

Qy 

160 

CCCGGCCGGAGGGGCAGCTTTGTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAG 

219 


i t i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i I t i t t > i i i i i t i 1 1 1 
1 I I I II 1 1 1 II 1 ! 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 I 


Db 

121 

CCCGGCCGGAGGGGCAGCTTTGrGGAGAI GG1 GGACAACC 1 GAbjbtaljUAAUl Ul^UUUAbj 

1 ft D 

Qy 

220 

GGCT ACT AC GT G GAGAT GAC C GT GGGC AG C C C CC C G C AGACGCT CAAC AT C CT GGT G GAT 

279 


i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i t i l 
I | | | M I I II 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 

181 

GGCTACTACGTGGAGATGACCGTGGGCAGCCCCCCGCAGACGCTCAACAI CC 1 GG1 GGAI 

o a n 

Z, ft u 

Qy 

280 

ACAGGCAGCAGTAACTTTGCAGTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTAC 

339 


i i i i i i i i i i i i i i i i i i i i i i i i i i i i i > i i i i i i i i i i i i i i i i i i i i i i i i 

1 M 1 1 1 1 1 1 1 M 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 

241 

ACAGGCAGCAGTAACTTTGCAGTGGGTGCTGCCCCCCACCCCTTCCTGCA1 CGC1AC1AC 

JUU 

Qy 

340 

CAGAGGCAGCTGTCCAGCACATACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACC 

399 


i i i i * i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i t 
1 I I II I I I I 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 

301 

CAGAGGCAGCTGTCCAGCACATACCGGGACCTCCGGAAGGGTGrbl AI Gl GCCC1 ACACC 


Qy 

400 

CAGGGCAAGTGGGAAGGGGAGCTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAAC 

459 


■ i i i i i t i i i i i i i i i i i i i i i r i i t I i I 1 i 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 

361 

CAGGGCAAGTGGGAAGGGGAGCTGGGCACCGACCTGGTAAGCAI CCCCCAI GGCCCCAAU 


Qy 

460 

GTCACTGTGCGTGCCAACATTGCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGC 

519 


1 1 1 1 1 1 1 1 1 « 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 
I | | | 1 1 1 1 1 II M 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 

421 

GTCACTGTGCGTGCCAACATTGCTGCCATCACTGAATCAGACAAG1 1 C 1 1 CA1 CAACGGC 

/ion 
H O U 

Qy 

520 

TCCAACTGGGAAGGCATCCTGGGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCC 

579 


i t i i i i i i i i t i i i i i i i i i i i i t t i i i t i i i i i i i i i t i i i i i i i i i i i i i t i l l l i l 1 

M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 11 1 M 1 1 1 1 1 1! 1 1 1 1 1 1 1 1 1 


Db 

481 

TCCAACTGGGAAGGCATCCTGGGGCTGGCCI A1GC1 GAGA1 1 (^UUACj(jOL.itjAUvj/\L.l 

3 4 U 

Qy 

580 

CTGGAGCCTTTCTTTGACTCTCTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTG 

639 


i i i i i i i i i i i i i i i i i t i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i t i i i i i i i i i i t 

1 1 1 j 1 1 1 1 i 1 1 1 1 m 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 

541 

CTGGAGCCTTTCTTTGACTCTCTGGTAAAGCAGACCCACG1 1 CCCAAL-Ul L-l itltLLib 

D U U 

Qy 

640 

CACCTTTGTGGTGCTGGCTTCCCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGG 

699 


II i i i i i i i i i 1 1 1 1 | 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 II 1 1 1 1 1 1 1 M 1 

II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 I 1 I 


Db 

601 

CAGCTTTGTGGTGCTGGC1 1 CCCCC1 CAACCAG1 LI CjAAbl btl LrtjL-L,l c lbiLbbAbbb 

D D U 

Qy 

700 

AGC AT GAT CAT T GGAG GT AT C GAC C ACT C GC T GT AC AC AGGCAGT CT CT G GT AT AC AC CC 

759 


i i i i i i i i i i i j i i i i i i i i i i i i i i i i i i i i i i I I I I I I I I I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 M II M 


Db 

661 

AGCATGATCAT1 GGAGG1A1 LGACCAL.1 Lbbl tal AUAUAUtjUAtjl bibl bbl/\i/\L/ibbb 

17 0 

Qy 

760 

AT C C GGC GGGAGT GGT ATT AT GAGGT CAT C ATT GT GC GGGT GGAGAT C AAT GGAC AGGAT 

819 


1 I I I 1 1 1 1 1 II ! 1 1 1 1 1 1 1 1 M 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 > 1 1 1 1 1 1 1 1 > 1 1 


Db 

721 

AT C C GGC GGGAGT GGT ATT AT GAG GT GAT CAT T GTGC GGGT GGAGAT C AAT GGAC AGGAT 

780 

Qy 

820 

CTGAAAAT GGACT GCAAGGAGTACAACTAT GACAAGAGCATTGT GGACAGTGGCACCACC 

879 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 M 


Db 

781 

CT GAAAAT GGACT GCAAGGAGTACAACTAT GAC AAGAGC ATT GT GGAC AGT GGC AC CAC C 

840 


Qy 

880 

AACCTTCGTTTGCCCAAGAAAGTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCC 

939 


1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 I 1 1 1 1 1 1 


Db 

841 

AACCTTCGTT I GCOOAAGAAACjI Gl 1 1 bAAbL 1 btAbl t/wil LAAbbUMbL.^ l 

900 

Qy 

940 

TCCACGGAGAAGTTCCCTGATGGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCA 

999 


I 1 1 1 1 1 1 1 1 I 1 1 I 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 


Db 

901 

TCCACGGAGAAGTTCCCTGATGGT I TCI GGC1 AGGAGAGCAGC 1 GG1 Gl bb 1 brbrUAAb^A 

q fin 

Z> U \J 

Qy 

1000 

GGCACCACCCCTTGGAACATTTTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACC 

1059 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 


Db 

961 

GGCACCACCCCTTGGAACATTTTCCCAGTCATCTCACTCTACCTAATGGGI GAGbl 1ALL 

1 H9H 
1UZU 

Qy 

1060 

AACCAGTCCTTCCGCATCACCATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTG 

1119 


I I M | 1 I 1 1 1 1 I 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 


Db 

1021 

AACCAGTCCTTCCGCATCACCATCCTTCCGCAGCAATACCTGCGGCCAG1 GGAAGA1 Gl br 

i n p n 

1UDU 

Qy 

1120 

GC CACGTCCCAAGACGACT GTTACAAGTTTGCCAT CT CACAGT CAT CCACGGGCACT GTT 

1179 


I I I I I I I I M 1 1 1 II 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 


Db 

1081 

GCCACGTCC CAAGAC GACT GTTACAAGTTTGCCAT CT CACAGT CAT CCACGGGCACT GTT 


Qv 

1180 

ATGGGAGCTGTTATCATGGAGGGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATT 

1239 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 II I 1 II 1 1 1 1 1 1 1 1 1 1 


Db 

1141 

AT GGGAGCT GT TAT CAT GGAGGG CT T CT ACGT T GT CTT T GAT C G GG C C C GAAAAC brAAl I 

i 9 n n 

Qy 

1240 

GGCTTTGCTGTCAGCGCTTGCCATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGC 

1299 


i M i | 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 M I 1 1 M 1 1 1 M 1 1 M 


Db 

1201 

GGCTTTGCTGTCAGCGCTTGCCATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGC 

_LZ DU 

Qy 

1300 

C CT T T T GT C AC CT T GGAC AT G GAAGACT GT GGCT ACAAC AT T C CAC AGACAGAT GAGT C A 

1359 


I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 

1261 

CCTTTT GTCACCTTGGACAT GGAAGACT GTGGCTACAACATT CCACAGACAGA1 GAG I LA 


Qy 

1360 

ACCCTCATGACCATAGCCTATGTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTC 

1419 


I I I I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 


Db 

1321 

ACCCTCATGACCATAGCCTATGTCATGGCTGCCATCTGCGCCCTCI I CA1GC1GCCAC1C 

1 OpA 

looU 

Qy 

1420 

TGCCTCATGGTGTGTCAGTGGCGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTT 

1479 


I I I 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 

1381 

TGCCTCATGGTGTGTCAGTGGCGCTGCCTCCGCTGCCTGCGCCAGCAGCAICirAlbrAL.1 1 1 

i a a n 

± ^ H U 

Qy 

1480 

GCT GAT GACATCTC CCT GCT GAAGT GAGGAGGCCCAT GGGCAGAAGAT AGAGATTCCCCT 

1539 


1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 

1441 

GCTGATGACATCTCCCTGCTGAAG1 GAGGAGGCCCA1 Cjbr(jCAbr/\/\b»/\l/\br/\br/\i i bbubi 

x j u \j 

Qy 

1540 

GGAC CAC AC CT CCGT GGTT C ACT T T GGT C AC AAGT AG GAGAC ACAGAT GGCAC CT GT G GC 

1599 


I | | | | | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 


Db 

1501 

GGAC CACACCTCCGT GGTT CACTTT GGT CACAAGi AbrCjAbrACAL-AUAl bibrL.AL.L.1 bri t^LrU 

1 

1JDU 

Qy 

1600 

CAGAGCACCTCAGGACCCTCCCCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAA 

1659 


1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 


Uu 

IJOl 

rAf^anrArrTrA^ArcCTCCCCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAA 

1620 

Qy 

1660 

GGCT GGCAAGGT GGGTTC CAGGGACT GTACCT GT AG G AAAC AGAAAAG AG AAG AAAGAAG 

1719 


1 I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 

1621 

GGCT GGCAAGGT GGGTT C CAGGGACT GT AC C T GT AGGAAAC AGAAAAGAGAAGAAAGAAG 

1680 

Qy 

1720 

CACTCTGCTGGCGGGAATACTCTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCT 

1779 


Db 

1681 

Qy 

1780 

Db 

1741 

Qy 

1840 

Db 

1801 

Qy 

1900 

Db 

1861 

Qy 

1960 

Db 

1921 

Qy 

2020 

Db 

1981 


I I I I I I I I I I I I I I I I I! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I M I I I 

CACTCTGCTGGCGGGAATACTCTTGGTCACCTCAAATTTAAGTCGGG7VAATTCTGCTGCT 174 0 

TGAAACTTCAGCCCTGAACCTTTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATT 1839 

I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I II I I I I I I I I I I I I I I 

TGA7\ACTTCAGCCCTGAACCTTTGTCCACCATTCCTTT7\AATTCTCCAACCCAAAGTATT 18 00 

CTTCTTTTCTTAGTTTCAGAAGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCC 18 99 

I I M | I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

CTTCTTTTCTTAGTTTCAGAAGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCC 18 60 

T GT GGT AC C CT GGC AGAGAAGAGAC CAAGCT T GTTTCCCTGCTGGC CAAAGT CAGT AGGA 1959 

I | | | I II I M I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

T GT G GT AC C CT GGC AGAGAAGAGAC CAAGC T T GT TT C C CT GCTG GC CAAAGT C AGTAGGA 192 0 

GAGGAT GCACAGTT T GCT AT T T GCT T TAGAGAC AGGGACT GTAT AAACAAGC CTAACAT T 2 019 

II I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I M I I I I I I II I I II I I II I I I i I I I I 

GAGGAT GCACAGT T T GC T ATTT G CT T T AGAGACAGGGACT GT AT AAACAAG C CTAACAT T 198 0 
GGTGCAAAGATTGCCTCTTGAATTAAAAAAAAAAAAAA 2 057 

I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I 

GGT GCAAAGATT GC CT CT T GAAT T AAAAAAAAACTAGA 2 018 


RESULT 14 

ABK91244 

ID 

ABK91244 standard; cDNA; 3252 BP. 

XX 


AC 

ABK91244; 

XX 


DT 

15-NOV-2002 (first entry) 

XX 


DE 

Human cDNA encoding Memapsin 2. 

XX 


KW 

Human; ss; gene; memapsin 2; aspartic protease; beta secretase; 

KW 

degenerative disease; Alzheimer's disease; amyloid precursor protein; 

KW 

APP; neuroprotective; nootropic; inhibitor; 

KW 

substrate side-chain preference. 

XX 


OS 

Homo sapiens. 

XX 


PN 

WO200253594-A2. 

XX 


PD 

ll-JUL-2002. 

XX 


PF 

28-DEC-2001; 20 01WO-US5082 6 . 

XX 


PR 

28-DEC-2000; 2000US-258705P . 

PR 

14-MAR-2001; 2 001US-275756P . 

XX 


PA 

( OKLA- ) OKLAHOMA MEDICAL RES FOUND. 

PA 

(UNII ) UNIV ILLINOIS FOUND. 

XX 


PI 

Tang JJN, Koelsch G, Ghosh AK; 

XX 


DR 

WPI; 2002-619088/66. 

DR 

P-PSDB; ABG78372, ABG78374. 


XX 

PT New memapsin 2 activity inhibitor useful in treatment of e.g. 

PT Alzheimer's disease 

XX 

PS Disclosure; Fig 6; 74pp; English. 
XX 

CC The invention relates to an inhibitor of catalytically active memapsin 2 

CC (an aspartic protease which can cleave at beta secretase sites), which 

CC binds to the active site of memapsin 2 defined by the presence of two 

CC catalytic aspartic residues and substrate binding cleft. Also 

CC included is a method of determination of the substrate side-chain 

CC preference in memapsin 2 sub-sites comprising: (a) reacting a mixture of 

CC memapsin 2 substrates with memapsin 2, and determining the sub-site 

CC preference of memapsin 2 by determining relative initial hydrolysis rates 

CC of the mixture of memapsin 2 substrates; or (b) preparing a combinatorial 

CC library of memapsin 2 inhibitors containing a base sequence taken from 

CC OM99-2 (Glu-Val-Asn-Leu-Ala-Ala-Glu-phe) , probing the library of 

CC inhibitors with memapsin 2 which binds to several inhibitors to generate 

CC several bound memapsin 2, and detecting the bound memapsin 2 with an 

CC antibody raised to memapsin 2 and an alkaline phosphatase conjugated 

CC secondary antibody. The inhibitors may be used in the manufacture of a 

CC medicament for the treatment of Alzheimer's disease since memapsin 2 may 

CC be involved in the cleavage of amyloid precursor protein (APP) , and for 

CC determining the substrate side-chain preference in memapsin 2 sub-sites. 

CC The present sequence encodes human memapsin 2. 

XX 

SQ Sequence 3252 BP; 804 A; 863 C; 811 G; 771 T; 3 other; 

Query Match 97.1%; Score 2010; DB 24; Length 3252; 

Best Local Similarity 99.8%; Pred. No. 0; 

Matches 2013; Conservative 0; Mismatches 5; Indels 0; Gaps 0; 


Qy 


Db 


4 0 GCGGGAGTGCTGCCTGCCCACGGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGC 99 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 

1 GCGGGAGTGCTGCCTGCCCACGGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGC 60 


Qy 


Db 


100 CTGGGGGGCGCCCCCCTGGGGCTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAG 159 
I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
61 CTGGGGGGCGCCCCCCTGGGGCTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAG 120 


Qy 


Db 


160 CCCGGCCGGAGGGGCAGCTTTGTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAG 219 

I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I M I I I I I M 
121 CCCGGCCGGAGGGGCAGCTTTGTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAG 180 


Qy 


Db 


220 GGCTACTACGTGGAGATGACCGTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGAT 27 9 

I | | | | | I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 

181 GGCTACTACGTGGAGATGACCGTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGAT 2 40 


Qy 


Db 


280 ACAGGCAGCAGTAACTTTGCAGTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTAC 339 

I I I I I I I I I I I I I I I I I I I I I M I I I I I I I II I I I I I I I I I M I I I I I I I I I I I I I I ! M 

241 ACAGGCAGCAGTAACTTTGCAGTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTAC 300 


Qy 


Db 


34 0 C AGAGG C AGCT GT C CAGC AC AT AC C GGGAC CT C C GGAAGGGT GT GT AT GT GCC CT AC AC C 399 

I | | | | | | || I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

301 CAGAGGCAGCTGTCCAGCACATACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACC 360 


Qy 


4 00 CAGGGCAAGTGGGAAGGGGAGCTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAAC 4 59 


Db 

3ol 

CAGGGLAAG1 GGGAAbbxbrbrAbL. 1 bbbL-.H.CCb.ttC.v^l bb ±/V\bW-vl ^ L- ^> \^t\ x uu^w^^nnv, 

420 

Qy 

460 

GT C ACT GT GC GT GCCAACAT T GCT GC C AT CACT GAAT CAGACAAGT T CTT CAT CAAC GGC 

519 


i i i i i i i i i i i i i i i i t i i i i i i i i i i i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 
I I I I I I ] I I M 1 1 1 M 1 1 1 1 1 1 1 1 1 M 1 M M M M M M 1 1 1 I 1 1 I I 1 1 1 1 1 1 1 1 l l M 


Db 

421 

GTCACTGTGCGTGCCAACA1 1 bb 1 bLLAl CAb 1 bAAl b/\b/\b/\/\b llLil b/\i L,/\zv^*cr^^ 

480 

Qy 

520 

TCCAACTGGGAAGGCATCCTGGGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCC 

579 


i i i i i i i i i i i i i i i i i i i i i i i i i i l 1 i l l l 1 i 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 
1! 1 1 1 1 1 II II 1 1 M I 1 1 1 1 II I 1 1 1 M 1 1 1 1 1 I II 1 1 1 II II II II It II 1 1 1 1 1 1 1 1 I 


Db 

481 

TCCAACTGGGAAGGCATCCTGGGGL 1 bbLLlAl bbl bAbAl 1 bbbAbbbb 1 bAbb/\b j. b^ 

540 

Qy 

580 

CTGGAGCCTTTCTTTGACTCTCTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTG 

639 


i i i i i i i i i i i i i i i i i i i i i i i i i i i i i t i i i i i i i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 I I || 1 1 II 1 1 1 1 11 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 II 1 1 I II II 1 1 1 1 1 1 I 1 II 1 1 1 1 1 


Db 

541 

CTGGAGCCTTTCTTTGACTCTCTGGTAAAGbAbAbLbAbbI 1LLLAALL1L1 iLlLLbib 

DUU 

Qy 

640 

CACCTTTGTGGTGCTGGCTTCCCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGG 

699 


t i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i l i i i i i i i i i i i i i i i i 1 1 1 1 1 

II I I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 > 1 1 1 1 * 1 1 1 M I 1 1 1 1 1 1 1 1 1 1 1 1 


Db 

601 

CAGCTTTGTGGTGCTGGCTTCCCCbl bAALLAbl LI bAAbl bbl bbbbl CI bl bbb/\bbb 

o u u 

Qy 

700 

AGC AT GAT CAT T GGAGGT AT CGACCACT CGCT GT AC ACAG GC AGT CT CT GGT AT AC AC C C 

759 


I 1 I 1 ( 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 ] 1 1 1 1 1 1 I 1 1 1 1 j 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 1 1 1 1 I 1 1 

M 1 1 1 1 1 1 1 1 1 i 1 1 1 E 1 1 1 1 1 1 1 M 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 

661 

AGCAT GAT CATT GGAGGTAT CGACbAb 1 bGL Ibl AbAbAbbbAb 1 b 1 b 1 bb 1 AI alallo 

7 o n 

Qy 

760 

AT C CGGC GGGAGT GGT AT TAT GAGGT CATC ATT GT GC GG GT G GAGAT CAAT G GAC AGGAT 

819 


■ i i i i i i i i i i i i i i i i i i i i i i i t i i i i i i i i i i i i i i i i i i i t i i i i i i i i i i i i i i 
1 M 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 


Db 

721 

AT CCGGC GGGAGT GGT ATTATGAGG1 GA1 bAl 1 b 1 Gbbbbl bbAbAl LAAl bbAbAbbAi 

7 R D 

Qy 

820 

CTGAAAATGGACTGCAAGGAGTACAACTATGACAAGAGCATTGTGGACAGTGGCACCACC 

879 


i i i i i i i i i i i i i i i i i i i i t t i i i i i i i i i i i i i i i i i i i i i i i i j i i j 1 i j 

I || || II 1 II 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 


Db 

781 

CTGAAAATGGACTGCAAGGAGTACAAb 1 Al GAbAAbAbbAl 1 bl bbALAb l bbbAbLALL 

R A 0 
o *i u 

Qy 

880 

AACCTTCGTTTGCCCAAGAAAGTGTTTG7VAGCTGCAGTCAAATCCATC7\AGGCAGCCTCC 

939 


i i i i i i i i i i i i i i t i i i i i i i i i i i i i i i i i i i i i i i t i i i i i 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 
I || II I II I I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 


Db 

841 

AACCTTCGTTTGCCCAAGAAAGTGTTTGAAGC I bbAbl bAAAl bbAl LAAbbbAbbb ibL 

_7 \J U 

Qy 

940 

TCCACGGAGAAGTTCCCTGATGGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCA 

999 



i i i i i i i i i i i i i i i i i i t i i i i i i i t i i i i i i i i i i i i i i i i i i i 1 1 1 1 1 t 1 I 1 1 1 1 1 1 
1 I I || 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 II II 1 M 1 1 1 1 II II 


Db 

901 

TCCACGGAGAAGTTCCC1 GAI GG1 1 1 C 1 GGC I AbbAbAbbAbb 1 bbi bibbibbLAab^A 

3? U \J 

Qy 

1000 

GGCACCACCCCTTGGAACATTTTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACC 

1059 


i i i i i i t t t t i i i i i i i i i i i i i i i i i i i i i i I I I I I I I I I I I l l l 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 

961 

GGCACCACCCCTTGGAACA1 1 I IbbbAblbAlblbAblblAbblAAl bbblbAbbl 

1 020 

Qy 

1060 

AAC CAGT C CTT C C GC AT CACCAT C CT T C C GCAGCAAT AC CT GCGGC CAGT GGAAGAT GT G 

1119 


i i i i i i i i t i i i i i i i i i i i i i i i i i i i i i i i i i i i i i t i I I I I I I I I I 1 1 1 1 1 1 1 1 1 1 1 
1 I I I 1 II 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 11 1 1 1 1 1 1 1 1 


Db 

1021 

AACCAGTCCTTCCGCA1 CACCA1 bbl 1 bbbbAbbAAlAbbl bbbbbb/\bi \j\jjai\\ji\± 

1080 

Qy 

1120 

GCCACGTCCCAAGACGACTGTTACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTT 

1179 


1 1 1 It 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 11 1 I 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
i i i i i i i i i i i i i i i i i i i i i i i i i i i i i j i i i i i i i i i i ■ i i i i 1 ■ ■ f ■ i i i i i 


Db 

1081 

GC C ACGT C C CAAGAC GACT GT T ACAAGT T T GC C AT CT CAC AGT CAT C C AC GGG CACT GT T 

1140 

Qy 

1180 

ATGGGAGCTGTTATCATGGAGGGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATT 

1239 


I I I I 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 II 1 1 1 1 


Db 

1141 

ATGGGAGCTGTTATCATGGAGGGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATT 

1200 

Qy 

1240 

GGCTTTGCTGTCAGCGCTTGCCATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGC 

1299 


I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I M I I 


Db 

1201 

Qv 

1300 

Db 

1261 

Ov 

1360 

Db 

1321 

Ov 

1420 

Db 

1381 

Ov 

1480 

Db 

1441 

Ov 

1540 

Db 

1501 

Ov 

1600 

Db 

1561 

Ov 

1660 

Db 

1621 

Ov 

1720 

Db 

1681 

Ov 

1780 

Db 

1741 

Ov 

1840 

Db 

1801 

Ov 

1900 

Db 

1861 

Qy 

1960 

Db 

1921 

Qy 

2020 

Db 

1981 


GGCTTTGCTGTCAGCGCTTGCCATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGC 1260 

C CT T TT GT CAC CT T GGAC AT G GAAGACT GT GGCT ACAACATT C C AC AGACAGAT GAGT C A 1359 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

C CT TT T GT CAC CT T GGAC AT GGAAGACT GT GGCTAC AACAT T C CAC AGACAGAT GAGT C A 1320 

ACCCTCATGACCATAGCCTATGTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTC 1419 

I I I I II I I I I I I I I I I I I M I I I II I I I I I I I I I I I I I M I II I I I I I I I I I I 

ACCCTCATGACCATAGCCTATGTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTC 138 0 

TGCCTCATGGTGTGTCAGTGGCGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTT 1479 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 

TGCCTCATGGTGTGTCAGTGGCGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTT 144 0 

GCT GATGACAT CT CCCT GCT GAAGTGAGGAGGCCCAT GGGCAGAAGATAGAGATT CCCCT 1539 

I I I I I I I I I I I I I I I I I I II I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I M I I I I 

GCT GAT GAC AT CT CC CT GCT GAAGT GAGGAGGC CCAT GGGC AGAAGAT AGAGATT C CCCT 1500 

GGACC AC ACCT C C GT GGTT CACT TT G GT CACAAGT AG GAGAC ACAGAT GGC ACCT GT GGC 1599 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I II I I I I I I I I I I I I I I II I I I I I I I I I 

GGAC CAC ACCT C CGT GGTTCACTTT GGT CACAAGT AGGAGACACAGAT GGCACCTGT GGC 1560 
C AGAGCAC CT C AGGAC C CT C CC C AC C CAC CAAAT G C CT CT G C CTT GAT GGAGAAGGAAAA 1659 

I I I M I I I I I I I I I I I M I E I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

CAGAG CAC CT CAGGAC C CT C C C CAC C CAC CAAAT GC CT CT GC C TT GAT GGAGAAGGAAAA 1 62 0 
GGCTGGCAAGGT GGGTT CCAGGGACT GTACCT GTAGGAAACAGAAAAGAGAAGAAAGAAG 1719 

I I I I I I I I I I II I I I I I I I M I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 

G GCT G G CAAGGT GGGTT C C AGGGACT GT AC CT GTAGGAAACAGAAAAGAGAAGAAAGAAG 168 0 
CACT CT GCT GGC GGGAATACT CTT GGT CAC CTCAAATTTAAGTCGGGAAATTCT GCT GCT 177 9 

M I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I M I 

CACT CTGCT GGC GGGAATACT CTT GGT CACCTCAAATTTAAGTCGGGAAATTCT GCT GCT 17 4 0 
T GAAACT T CAGC C CT GAAC CT TT GT C CAC CAT T C CT TTAAAT T CT C CAACC CAAAGT AT T 1839 

M I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I M I I I 

X GAAACT T CAGC C CT GAAC CT T T GTC CAC CAT T C CT T T AAATT CTC CAACC CAAAGT AT T 1800 

CTTCTTTTCTTAGTTTCAGAAGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCC 18 99 

I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I E I I I I 

CTTCTTTTCTTAGTTTCAGAAGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCC 1860 

TGTGGTACCCTGGCAGAGAAGAGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGA 1959 

I | | | | | || I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I II I I I I I I I I I I II I I 

TGTGGTACCCTGGCAGAGAAGAGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGA 1920 

GAGGATGCACAGTTTGCTATTTGCTTTAGAGACAGGGACTGTATAAACAAGCCTAACATT 2 019 

i I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 

GAGGAT G CACAGT T T GCT ATTT GCT T TAGAGAC AGGGACT GT AT AAACAAGC CT AACAT T 1980 

GGTGCAAAGATTGCCTCTTGAATTAAAAAAAAAAAAAA 2057 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

GGTGCAAAGATTGCCTCTTGAATTAAAAAAAAACTAGA 2018 


RESULT 15 
ABK88641 


ID ABK88641 standard; cDNA; 3252 BP. 
XX 

AC ABK88641; 
XX 

DT 07-OCT-2002 (first entry) 
XX 

DE cDNA encoding human memapsin 2. 
XX 

KW Human; memapsin 2; beta secretase; aspartic protease; APP; 

KW beta-amyloid precursor protein; amyloid plaque; Alzheimer's disease; 

KW neuroprotective; nootropic; expressed sequence tag; EST; gene; ss. 

XX 

OS Homo sapiens. 


XX 

FH Key Location/Qualifiers 

FT CDS 1. . 1467 

FT /*tag= a 

FT /partial 

FT /product- "Memapsin 2" 

FT /note= "This sequence lacks a start codon" 

XX 


PN US2002049303-A1. 
XX 

PD 25-APR-2002. 
XX 

PF 28-FEB-2001; 2001US-0796264 . 
XX 

PR 28-JUN-1999; 99US-14 1363P . 

PR 30-NOV-1999; 99US-168060P . 

PR 25-JAN-2000; 2 000US-177 836P . 

PR 27-JAN-2000; 2000US-178368P . 

PR 27-JUN-2000; 2 000US-0604 608 . 
XX 

PA (TANG/) TANG J J N. 

PA (LINX/) LIN X. 

PA (KOEL/) KOELSCH G. 

PA (HONG/) HONG L . 

XX 

PI Tang JJN, Lin X, Koelsch G, Hong L; 
XX 

DR WPI; 2002-507280/54. 

DR P-PSDB; AAU99488. 
XX 

PT New recombinant catalytically active memapsin 2, useful to screen for 

PT inhibitors of memapsin 2 which can be used to prevent and treat 

PT Alzheimer ! s disease 
XX 

PS Example 1; Page 20-21; 44pp; English. 
XX 

CC The present invention relates to methods for the production of 

CC purified, recombinant catalytically active, memapsin 2 (beta 

CC secretase) . Memapsin 2, a member of the aspartic protease family, 

CC cleaves beta-amyloid precursor protein (APP) found in amyloid plaques. 

CC The recombinant memapsin 2 is useful for identifying inhibitors of 

CC memapsin 2 in the design of drugs for the treatment and/or prevention 

CC of Alzheimer's disease. The recombinant memapsin 2 can be used to 

CC immunise against Alzheimer's disease. The present sequence encodes 


CC human memapsin 2 . 
XX 

SQ Sequence 3252 BP; 804 A; 863 C; 811 G; 771 T; 3 other; 

Query Match 97.1%; Score 2010; DB 24; Length 3252; 

Best Local Similarity 99.8%; Pred. No. 0; 

Matches 2013; Conservative 0; Mismatches 5; Indels 0; Gaps 0; 


Ov 

40 

GCGGGAGTGCTGCCTGCCCACGGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGC 

99 


I I M | I I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 

1 

GCGGGAGTGCTGCCTGCCCACGGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGC 

60 

Ov 

100 

CTGGGGGGCGCCCCCCTGGGGCTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAG 

159 


I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


LJkJ 

61 

CTGGGGGGCGCCCCCCTGGGGCTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAG 

120 

Ov 

160 

CCCGGCCGGAGGGGCAGCTTTGTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAG 

219 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II E 1 1 1 1 1 1 1 1 1 I 1 1 


Db 

121 

CCCGGCCGGAGGGGCAGCTTTGTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAG 

180 

wy 

220 

GGCTACTACGTGGAGATGACCGTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGAT 

279 


I I I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 

181 

GGCTACTACGTGGAGATGACCGTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGAT 

240 

wy 

280 

ACAGGCAGCAGTAACTTTGCAGTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTAC 

339 


I | | | | | | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 


Db 

241 

ACAGGCAGCAGTAACTTTGCAGTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTAC 

300 

yy 

34 0 

CAGAGGCAGCTGTCCAGCACATACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACC 

399 

Db 

301 

I I I I I 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 I 1 ! 1 1 i 1 I 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 M II 

CAGAGGCAGCTGTCCAGCACATACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACC 

360 

Ov 

400 

CAGGGCAAGTGGGAAGGGGAGCTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAAC 

459 


I I I 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 M 1 1 1 1 II 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 

361 

CAGGGCAAGTGGGAAGGGGAGCTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAAC 

420 

Ov 

460 

GTCACTGTGCGTGCCAACATTGCTGCCATCACTGAATCAGACAAGTTCTTCATC7\ACGGC 

519 



I | | | | | | | | | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 

421 

GT C ACT GT GC GT GC CAAC AT T G CT GC CAT CAC T GAAT C AGACAAGT T CT T CAT CAAC GGC 

480 

Qv 

520 

TCCAACTGGGAAGGCAT'CCTGGGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCC 

579 


I | | | | | 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 

481 


54 0 

Qy 

580 

CTGGAGCCTTTCTTTGACTCTCTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTG 

639 


1 1 1 1 1 1 M 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 II 1 1 1 1 


Db 

541 

CTGGAGCCTTTCTTTGACTCTCTGGTTW^GCAGACCCACGTTCCCAACCTCTTCTCCCTG 

600 

Qy 

640 

CACCTTTGTGGTGCTGGCTTCCCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGG 

699 


II | | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 M 


Db 

601 

CAGCTTTGTGGTGCTGGCTTCCCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGG 

660 

Qy 

700 

AGCAT GAT CATT GGAGGTAT CGACCACTCGCT GT ACACAGGCAGT CTCTGGTAT ACACCC 

759 


I I I I I I I I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 I 1 


Db 

661 

AGCAT GAT CAT T GGAGGTAT CGAC C ACT C GCT GT AC AC AGG CAGT CT CT GGT AT ACACC C 

720 

Qy 

760 

AT CCG GC GGGAGT G GT AT TAT GAGGT CAT CAT T GT GC GGGT GGAGAT CAAT GGACAGGAT 

819 


Db 

721 

Qy 

820 

Db 

781 

Qy 

880 

Db 

841 

Qy 

940 

Db 

901 

Qy 

1000 

Db 

961 

Qy 

1060 

Db 

1021 

Qy 

1120 

Db 

1081 

Qy 

1180 

Db 

1141 

Qy 

1240 

Db 

1201 

Qy 

1300 

Db 

1261 

Qy 

1360 

Db 

1321 

Qy 

1420 

Db 

1381 

Qy 

1480 

Db 

1441 

Qy 

1540 

Db 

1501 

Qy 

1600 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


CT GAAAAT GGACT GCAAGGAGTACAACTATGACAAGAGCATT GT GGACAGT GGCACCACC 879 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 

CTGAAAAT GGACT GCAAGGAGTACAACTAT GACAAGAGCATT GT GGACAGT GGCACCACC 84 0 

AACCTTCGTTTGCCCAAGAAAGTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCC 939 

I | | | | I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I II I I I I M I 

AAC CT T C GT T T G C C CAAGAAAGT GTTT GAAGCT GC AGT CAAAT C CAT CAAGGC AGC CT C C 900 

TCCACGGAGAAGTTCCCTGATGGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCA 999 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I M I II I I I I I I I I I I I I I I I I I I 

TCCACGGAGAAGTTCCCTGATGGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCA 960 

GGCACCACCCCTTGGAACATTTTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACC 1059 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 

GGCACCACCCCTTGGAACATTTTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACC 1020 

AAC C AGT C CTT C C GC AT CAC CAT C CT T C CGCAGCAAT AC CT G C GGC CAGT GGAAGAT GT G 1119 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I 1 I I II I I I I I I I I I I I I 
AAC CAGT C CT T C C GCAT CAC CAT C CT T C CGCAGCAAT AC CT GC GGC CAGT G GAAGAT GT G 1080 

GC C AC GT C C CAAGAC GACT GT T ACAAGT TT GCCAT CT CACAGT CAT C CAC GGGC ACT GT T 117 9 

|| | | | | | | | I I I I I I I I i I I I 1 I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I 

GC CAC GT C C CAAGAC GACT GT T ACAAGTTTGC CAT CT CACAGT CAT C CAC GGGCACT GT T 1140 

AT G GGAGCT GT T AT CAT G GAG GGCT T CT AC GTT GT CT T T GAT C GGG C C C GAAAAC GAAT T 1239 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I M I I I I I M I 

ATGGGAGCTGTTATCATGGAGGGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATT 1200 

GGCTTTGCTGTCAGCGCTTGCCATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGC 12 99 

M I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I M I I I E I I I I I I I I I I I I I I I I I I I I I 

GGCTTTGCTGTCAGCGCTT GCCAT GTGCAC GAT GAGTTCAGGACGGCAGCGGTGGAAGGC 12 60 

CCTTTTGT CACCTT GGACAT GGAAGACT GT GGCT ACAAC ATT C CAC AGACAGATGAGT CA 1359 

| | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I II I I I II I II I I 

CCT T T T GT CAC CT T GGACAT GGAAGACT GT GGCT ACAACATT C C AC AGAC AGAT GAGT C A 132 0 

ACCCTCATGACCATAGCCTATGTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTC 1419 

I | | | | | | | | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I II I I I I I II I M I 1 I I 

ACCCTCATGACCATAGCCTATGTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTC 138 0 

TGCCTCATGGTGTGTCAGTGGCGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTT 147 9 
I I I I I I I I I I I M 1 I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
TGCCTCATGGTGTGTCAGTGGCGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTT 144 0 

GCT GAT GAC AT CT C C C T G CT GAAGT GAGGAGGC C CAT GGGCAGAAGAT AGAGAT T C C C CT 1539 

I I M II I I I I I I I I I I M I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I 

GCT GAT GACAT CTCCCTGCT GAAGT GAGGAGGC C CAT GGGCAGAAGAT AGAGATT C CC CT 1500 
GGACCACAC CT C C GT GGT T C ACT T T GGT C ACAAGT AGGAGAC AC AGAT G GC AC CT GT GG C 1599 

M | || | | | | | | | | I I I I I I I I M I I I I I I I I I I II I I I I I I I I I I I I I I I M I I I I I I M 

GGACCACACCTCCGTGGTTCACTTTGGTCACAAGTAGGAGACACAGATGGCACCTGTGGC 1560 

C AGAGCACCT CAG GAC C C T C C C CAC C CAC CAAAT G C CT CT GCCTT GAT GGAGAAGGAAAA 1659 
I I I I I I I I I I I I I M I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 


Db 


1561 


CAGAGCACCTCAGGACCCTCCCCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAA 


1620 


Qy 1660 G GCT GGCAAGGT G G GTT C CAGG GACT GT AC C T GT AGGAAAC AGAAAAGAGAAGAAAGAAG 1719 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I 
Db 1621 GGC T G GCAAG GT GG GTT C CAGGGACT GT ACCT GT AGGAAAC AGAAAAGAGAAGAAAGAAG 1680 

Qy 1720 CACTCTGCTGGCGGGAATACTCTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCT 1779 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I 

Db 1681 CACTCTGCTGGCGGGAATACTCTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCT 17 40 

Qy 1780 TGAAACTTCAGCCCTGAACCTTTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATT 1839 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
Db 1741 TGAAACTTCAGCCCTGAACCTTTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATT 18 00 

Qy 1840 CTTCTTTTCTTAGTTTCAGAAGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCC 18 99 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1801 CTTCTTTTCTTAGTTTCAGAAGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCC 18 60 

Qy 1900 TGTGGTACCCTGGCAGAGAAGAGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGA 1959 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I 
Db 1861 TGTGGTACCCTGGCAGAGAAGAGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGA 1920 

Qy 1960 GAGGATGCACAGTTT GCTATTT GCTTTAGAGACAGGGACT GT ATAAACAAGCCTAACATT 2 019 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 
Db 1921 GAGGAT GCAC AGT T T GCT AT TT GCT TT AGAGACAGGGAC T GT AT AAACAAGC CT AAC AT T 198 0 

Qy 2020 GGTGCAAAGATTGCCTCTTGAATTAAAAAAAAAAAAAA 2057 

I I M II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1981 GGT GCAAAGAT T GC CT CT T GAAT TAAAAAAAAACT AGA 2018 
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RESULT 1 

US-09-548-372D-3 

; Sequence 3, Application US/09548372D 

; Patent No. 6420534 . 

; GENERAL INFORMATION: 

; APPLICANT: GURNEY ET AL. 

; TITLE OF INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR 
AND USES 

TITLE OF INVENTION: THEREOF 
; FILE REFERENCE: 29915/62801 

; CURRENT APPLICATION NUMBER: US/09/548, 372D 
; CURRENT FILING DATE: 2000-04-12 
; PRIOR APPLICATION NUMBER: US 60/155,493 
; PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: US 09/404,133 
; PRIOR FILING DATE: 1999-09-23 
; PRIOR APPLICATION NUMBER: PCT/US99/2 08 8 1 
; PRIOR FILING DATE: 1999-09-23 


PRIOR APPLICATION NUMBER: US 60/101,594 
PRIOR FILING DATE: 1998-09-24 
NUMBER OF SEQ ID NOS : 73 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 3 
LENGTH: 2 07 0 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-09-548-372D-3 

Query Match 99.9%; Score 2068.4; DB 4; Length 2070; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 2069; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

I I I I I I I I I I I I I I I M I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

Qy 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 12 0 

Qy 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 18 0 

Qy 181 GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGATGACC 24 0 

I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGATGACC 24 0 

Qy 241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 30 0 

I I I I I I I I I I I I I I I I I I I I I I I ! I I E I I I I I I I I I I 11 I I I I I I I I I I I I I I II I I I I I 

Db 241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 300 

Qy 301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I II I I I I I I I I I 
Db 301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

Qy 361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 42 0 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I II I I I I I I I I I I 
Db 361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 42 0 

Qy 421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 48 0 

I I I I I I II I I I I II I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I 

Db 421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 480 

Qy 481 GCT GCCATCACT GAAT CAGACAAGTT CTTCAT CAACGGCT CCAACT GGGAAGGCAT CCT G 54 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
Db 481 GCT GCCATCACT GAAT CAGACAAGTT CTTCAT CAACGGCT C CAACT GGGAAGGCAT CCT G 54 0 

Qy 541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I 
Db 541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

Qy 601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I II 

Db 601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCAGCTTTGTGGTGCTGGCTTC 660 


Qy 661 CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGGTATC 72 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 661 CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGGTATC 72 0 

Qy 721 GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTATTAT 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I 

Db 721 GACCACTCGCTGTACACAGGCAGTCTCTGGTATACAGCCATCCGGCGGGAGTGGTATTAT 780 

Qy 781 GAGGT CATCATTGTGCGGGT GGAGATCAAT GGACAGGAT CT GAAAAT GGACT GCAAGGAG 84 0 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 781 GAGGT CATCATTGT GCGGGT GGAGATCAAT GGACAGGAT CT GAAAAT GGACT GCAAGGAG 84 0 

Qy 841 T ACAACT AT GACAAGAG C ATT GT GGACAGT GG CAC C AC CAACCT T C GTT T GC CCAAGAAA 90 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 

Db 841 T ACAAC TAT GACAAGAG C ATT GT GGACAGT GG CAC CAC CAACCT TCGTTTGCC CAAGAAA 900 

Qy 901 GT GT T T GAAGCT GCAGT CAAAT C CAT CAAGGC AGC CT C CT C CAC GGAGAAGTT C C CT GAT 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I 

Db 901 GT GT TT GAAGCT GCAGT CAAAT C C ATCAAG GCAGC CT C CT C CAC GGAGAAGTT C C CT GAT 960 

Qy 961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I M I I I I I I I I I I I I I I I I I I 

Db 961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 1020 

Qy 1021 TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCACC 1080 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I i I I I ! I I I I I I I I I I I I I I 

Db 1021 TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCACC 1080 

Qy 1081 AT C C T T C C GCAGCAAT AC CT GC GGC CAGT GGAAGAT GT GGC CAC GT C C CAAGAC GACT GT 1140 

I I I I I I II I I I I I I I I I I I I M I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1081 ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGACTGT 1140 

Qy 1141 T ACAAGT T T GCCAT C T CAC AGT CAT C CAC GGGC ACT GT TAT GGGAGCT GT TAT CAT GGAG 1200 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I 

Db 1141 T ACAAGT T T GCCAT CT CAC AGT CAT C CAC GGGC AC T GT TAT GGGAGCT GT TAT CAT GGAG 12 00 

Qy 1201 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 12 60 

I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1201 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 12 60 

Qy 1261 CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGACATG 132 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I 

Db 1261 CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGACATG 1320 

Qy 1321 GAAGACT GT GGCTACAACATT CCACAGACAGAT GAGT CAACCCTCAT GACCATAGCCTAT 1380 

I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1321 GAAGACT GT GGCTACAACATT CCACAGACAGAT GAGT CAACCCTCAT GACCATAGCCTAT 13 80 

Qy 1381 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 1440 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I II 

Db 1381 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 14 4 0 

Qy 1441 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I LI I I I I I I I I I I I I I I I I I I 

Db 1441 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 


Qy 1501 AAGT GAGGAGGC C CAT GGG C AGAAGAT AGAGAT T C C C CT GGAC CACAC CT C C GT GGTT CA 1560 

I I I M I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I 
Db 1501 AAGT GAGGAGGC C CAT GGGC AGAAGAT AGAGAT T C C CCT GGAC CACAC CT CC GT GGT T C A 1560 

Qy 1561 CTTT GGT CACAAGTAGGAGACACAGATGGCACCT GT GGCCAGAGCACCTCAGGAC CCT CC 1620 

I I I I II I I I II II I I I I I I I I I I I I I I I I I I II I I I I II II I I I I I I I I II I I I I I I I I I 

Db 1561 C T T T GGT CACAAGT AG GAGACACAGAT GGCACCT GT GGC C AGAG C AC CT C AGGAC C CT C C 1620 

Qy 1621 CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCAG 1680 

I | | | | | | | | I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I M 

Db 1621 CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCAG 1680 

Qy 1681 GGACTGTACCTGTAGGAAACAGAAAAGAGAAGAAAGAAGCACTCTGCTGGCGGGAATACT 1740 

I | | | | | | | | II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1681 GGACTGTACCTGTAGGAAACAGAAAAGAGAAGAAAGAAGCACTCTGCTGGCGGGAATACT 17 40 

Qy 1741 CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCT 1800 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1741 CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCT 1800 

Qy 1801 TTGTCCACCATTCCTTTAAATTCTCCAACCCA7\AGTATTCTTCTTTTCTTAGTTTCAG7\A 1860 

I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I 

Db 1801 TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 1860 

Qy 1861 GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAAG 1920 

I I I I I I I I I ! I E I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1861 GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAAG 1920 

Qy 1921 AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCTATT 19 8 0 

I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I M I I I M I I 

Db 1921 AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCTATT 198 0 

Qy 1981 T GCT T T AGAGAC AG GGAC T GT ATAAACAAGC CT AACAT T GGT GCAAAGAT TGCCTCTT GA 204 0 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
Db 1981 T GCT T T AGAGAC AGG GACT GT AT AAACAAGC CTAACAT T GGT GCAAAGAT T G C CT CT T GA 204 0 

Qy 2041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2070 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 
Db 2041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2070 
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Qy 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I 

Db 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

Qy 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I II I I I I I I I I I I I I I I I I I I I 

Db 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 120 

Qy 121 CT GC GGCT GC C C C GGGAGACC GAC GAAGAG C C C GAGGAGC C C GGCC GGAG G G GCAGCT T T 180 

I I I I I II I I I I I I I II I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 180 

Qy 181 GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGATGACC 240 

I I I I I I II I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGATGACC 24 0 

Qy 241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 300 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 

Db 241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 300 

Qy 301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

Qy 361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 420 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 420 

Qy 421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 4 80 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
Db 421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 4 80 

Qy 481 G C T G C CAT C ACT GAAT CAGACAAGTT CT T CAT CAAC GGCT C CAACT GGGAAGGCAT C CT G 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 4 81 GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGT^AGGCATCCTG 540 

Qy 541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I 

Db 541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 


Qy 


601 


CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 




t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 


rTnGTAAAGrAGACrCACGTTCCCAACCTCTTCTCCCTGCAGCTTTGTGGTGCTGGCTTC 

660 

Qy 

661 

CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGGTATC 

720 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 


Db 

DOl 

pppptp a APPAPTPTGAAGTGPTGGPPTCTGTCGGAGGGAGCATGATCATTGGAGGTATC 

720 

Qy 

721 

GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTATTAT 

780 


I 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 I 1 1 1 1 1 1 1 


Db 

/ z ± 

p arr APTPPPTPTAPAPAGGPAGTPTPTGGTATACACCCATCCGGCGGGAGTGGTATTAT 

780 

Qy 

781 

GAGGTCAT CATT GTGC GGGTGGAGATCAATGGACAGGAT CT GAAAAT GGACTGCAAGGAG 

840 


I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 

/ o J. 

r a r pt P AT P ATT PTPrP CCGTGG AG ATP A ATGG AC AGGATCTGAAAAT GGACT GCAAGGAG 

840 

Qy 

841 

T AC AAC TAT GAC AAG AG CAT T GT G G AC AGT G G C AC C AC C AAC CTTCGTTTGCC C AAG AAA 

900 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 ; 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 

P A 1 
O fl ± 

t ap A APT ATPtAP AAPrAGPATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCCAAGAAA 

900 

Qy 

901 

GTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCTGAT 

960 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 


Db 

q m 

y u jl 

r m r mmm r a a p pt p p A PT P A A AT PP ATP A AGGCAGPPTPCTCCACGGAGAAGTTCCCT GAT 

960 

Qy 

961 

GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 

1020 


| | I I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 

y bi 

rr'TTTr'TPPPT A PPAPAPP APtPTPPTPtTPPTHGP A APiPAGGPAPCACCCCTTGGAACATT 

1020 

Qy 

1021 

TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCACC 

1080 


I I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 

1UZ1 

ttpppaptp ATPTPAPTPTAPPTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCACC 

1080 

Qy 

1081 

AT C CTT C C GC AGCAAT AC CT GC GG C CAGT GGAAGAT GT GGC CACGT CCCAAGAC GACT GT 

1140 


I | | | | 1 1 1 1 ! 1 1 1 1 1 1 II 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 


Db 

1081 

AT C CT T C C G CAG CAAT AC CT GC GGC CAGT GGAAGAT GT GGC C AC GT CCCAAGAC GACT GT 

1140 

Qy 

1141 

T ACAAGT T T GC CAT CT C AC AGT CAT C C AC GGGCAC T GT TAT GGGAGCT GTT AT C AT GGAG 

1200 


I 1 I 1 1 M I 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 M 1 


Db 

114 1 

J. J_ Q X 

TAPAAP/TTTGPPATPTPAPAGTCATCCACGGGCACTGTTATGGGAGCTGTTATCATGGAG 

1200 

Qy 

1201 

GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 

1260 

Db 

1 9 C\ 1 

1 1 1 M I 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

PPrPTTPTAPGTTGTPTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 

1260 

Qy 

1261 

CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGACATG 

1320 


I 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 


Db 

19^1 
XZ O X 

PATnTGPAPGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGACATG 

1320 

Qy 

1321 

GAAGACTGTGGCTACAACATTCCACAGACAGATGAGTCAACCCTCATGACCATAGCCTAT 

1380 


1 I I I I I 1 1 1 II 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 


Db 

1 ^91 

X jZ J. 

pa APrAPTGTGGCTAP AAC AT T C C AC AGAC AGAT GAGT C AAC C CT CAT GAC C AT AGC CT AT 

1380 

Qy 

1381 

GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 

1440 


M 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 

1381 

GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 

1440 

Qy 

1441 

CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 

1500 


I | I I 1 1 1 I | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



Db 1441 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 

Qy 1501 AAGTGAGGAGGCCCATGGGCAGAAGATAGAGATTCCCCTGGACCACACCTCCGTGGTTCA 1560 

I I I I I I I 1 I I I I I I I I I I I I M I I I I 1 M I M f I M I I I I I I M I M I I M I I i I I I M I 

Db 1501 AAGTGAGGAGGCCCATGGGCAGAAGATAGAGATTCCCCTGGACCACACCTCCGTGGTTCA 1560 

Qy 1561 CTTT GGT CACAAGTAG GAGACACAGAT G GCAC CTGT GGC CAGAGCAC CTCAGGAC C CT C C 1620 

II I I M I ! I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I M I I I I I II I I I I I I I M 

Db 1561 CTTTGGTCACAAGTAGGAGACACAGATGGCACCTGTGGC CAGAGCAC CTCAGGAC C CT C C 1620 

Qy 1621 CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCAG 1680 

I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I 1 I I I I I I I M I I I I. ! I I II I I I I M 
Db 1621 C C ACC CAC CAAAT GC CT CT GCCT T GAT G GAGAAGGAAAAGGCT GGCAAGGT GGGT T C CAG 1680 

Qy 1681 GGACTGTACCTGTAGGAAACAGAAAAGAGAAGAAAGAAGCACTCTGCTGGCGGGAATACT 1740 

I I I I I I i M M M M M I If II It M II M II I M M I I II M I M M M I II M It I M 

Db 1681 GGACT GT AC CT GTAGGAAACAGAAAAGAGAAGAAAGAAG CACT CT GCT GGCGGGAAT ACT 1740 

Qy 1741 CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCT 1800 

I I M I I I I I I I I I I I I M M I I I I I I I I I I I I II I I I I II I I I II I I I I II I I I I I I I M 
Db 1741 CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCT 1800 

Qy 1801 TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 1860 

I II I I I I I I I I I I I I I II I II I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 

Db 18 01 TTGTCCACCATTCCTTTAAATTCTCCTU^CCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 1860 

Qy 18 61 GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAAG 1920 

II I I I II I I I I I I I I I M I I I I I I M I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I 

Db 1861 GT ACT GGCAT CACAC GCAGGTT AC CT T GGC GTGT GT C C CT GT GGT AC CCT GGCAGAGAAG 1920 

Qy 1921 AGAC CAAGCT T GTTTCCCTGCT G GC CAAAGT C AGT AGGAGAGGAT GCACAGTT T G CTAT T 1980 

I II I I I I I I I II I I I I I I I I I II I I I I I I I I I I I II II I I M I I M I I I I I I I II I I M i 

Db 1921 AGACC AAGCT T GT T T C C CT GCT GGC CAAAGT CAGT AGGAGAGGAT GCACAGTT T GCT AT T 1980 

Qy 1981 T GCTTTAGAGACAGGGACTGTATAAACAAGCCTAACATT GGT GCAAAGATTGCCT CTT GA 204 0 

I I I I I M I I I I I I I II I I I II I I I I I I I I I M I I II I I I I I I I I I I I I M I I I I I II I I I 
Db 1981 TGCTTTAGAGACAGGGACTGTATAAACAAGCCTAACATTGGTGCAAAGATTGCCTCTTGA 204 0 

Qy 2041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2070 

I II I I I I I I I I I I I I I I I I M I I I I I I I I I 

Db 2041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2070 


RESULT 3 

US-09-551-853D-3 

; Sequence 3, Application US/09551853D 

; Patent No. 6500667 

; GENERAL INFORMATION: 

; APPLICANT; GURNEY ET AL. 

; TITLE OF INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR 
AND USES 

; TITLE OF INVENTION: THEREOF 
; FILE REFERENCE: 29915/6280L 

; CURRENT APPLICATION NUMBER: US/09/551, 853D 
; CURRENT FILING DATE: 2000-04-18 
; PRIOR APPLICATION NUMBER: US 60/155,493 
PRIOR FILING DATE: 1999-09-23 


; PRIOR APPLICATION NUMBER; US 09/404,133 

; PRIOR FILING DATE: 1999-09-23 

; PRIOR APPLICATION NUMBER: PCT/US99/20881 

; PRIOR FILING DATE: 1999-09-23 

; PRIOR APPLICATION NUMBER: US 60/101,594 

; PRIOR FILING DATE: 1998-09-24 

; NUMBER OF SEQ ID NOS : 73 

; SOFTWARE : Patentln version 3.1 

; SEQ ID NO 3 

LENGTH: 2070 

TYPE: DNA 
; ORGANISM: Homo sapiens 
US-09-551-853D-3 

Query Match 99.9%; Score 2068.4; DB 4; Length 2070; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 2069; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 


Qy 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

I I It I I II I I It I I I I I I I I I I I M I I I I I I I I I I I I I I M I I I I I M I I I I I I I M I I I 
Db 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

Qy 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 120 

I II I I I M I M M I II t M I I M M M M I II I t I I M M II M M M I M M I M M M 

Db 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 120 

Qy 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 180 

t I I I I II I I I I I i I I I I I I I I I It II I I I I I I I I I I I M I I I II i I I I I I I I I I I I I I I I 
Db 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 180 

Qy 181 GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGATGACC 2 40 

I I I I I I I 1 I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 181 GT GGAGAT GGT GGACAACCT GAGGGGCAAGTCGGGGCAGGGCTACTAC GT GGAGATGACC 2 40 

Qy 241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 300 

I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 GT GGGC AG C C C C C C GCAGAC GCT CAACAT CCT GGT GGAT ACAGGCAGCAGT AACTTT GCA 300 

Qy 301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

I I I I I I I I I I I I I I I I M I I I I I I t II I I I I I I I I I It I I I I I M I I I I I II I I I I I II I 
Db 301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

Qy 361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 420 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I M I I I I M I II I II I I I I I M I II M I I I 

Db 361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 420 

Qy 421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 4 80 

I I I I I I I I I I I I I I I I I I I I I It I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 480 

Qy 481 GCT GCCAT CACTGAAT CAGACAAGTT CTTCAT CAACGGCT CCAACT GGGAAGGCAT CCT G 54 0 

I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATCCTG 54 0 

Qy 541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

I I I I M I I M II II II I I I II I M M II I I i I I I I I M II M I I I II I I I II II I I I I II 

Db 541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 


Qy 601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

I I M I I I I I I II I I I I I I I 1 I I I I I M I I I I I 1 I I I 1 II I I I I I I I I I I I I I I I II I I I 

Db 601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCAGCTTTGTGGTGCTGGCTTC 660 

Qy 661 CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGGTATC 720 

M I I I II I I I I I M I I I I I I I M I I I I I I I II I I I I I I 11 I II I I I I I I I I M M II I I I 

Db 661 CCCCT CAACCAGT CT GAAGTGCT GGCCTCT GT CGGAGGGAGCAT GAT CATT GGAGGT AT C 720 

Qy 721 GAC CACT C GCT GT ACACAGGCAGT CT CT GGT AT ACACCCAT CC GGC GGGAGT GGT AT TAT 7 80 

I I I I I I I I I I I II II II I II 1 I I I I I I II I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I 

Db 721 GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTATTAT 7 80 

Qy 781 GAGGTCAT CATT GT GCGGGT GGAGAT CAAT GGACAGGAT CT GAAAAT GGACT GCAAGGAG 84 0 

II I I M I I I I I I I I I I I I I M II I I I I I I I M II M I I II M II M I I I I I M I I I I I I I 

Db 7 81 GAGGT CAT CATT GT GCGGGT GGAGAT CAAT GGACAGGAT CT GAAAAT GGACT GCAAGGAG 840 

Qy 841 T ACAACT AT GACAAGAGCATT GT GGACAGT GGCACCAC CAACCTT C GT TT GCC CAAGAAA 900 

I I I I I I I I I I I I M M I I I I I I II I I M M I I II I I I I I I I I I I I I I I M I I I I M I II I 

Db 841 T AC AAC T AT G AC AAG AGC AT T GT G G AC AGT G G C AC C AC C AAC CTTCGTTTGCC CAAGAAA 900 

Qy 901 GT GTTT GAAGCT GCAGT CAAAT C CAT CAAGGCAGCCTCCT CCACGGAGAAGTT CCCT GAT 960 

I I I M I I I I I I M I I I I I I I I I I I I I I I I I I I I M I I I I I I I I II I I I I II I I I I I I I I I 

Db 901 GT GTTT GAAGCT GCAGT CAAAT CCAT CAAGGCAGCCT CCTCCACGGAGAAGTT CCCT GAT 960 

Qy 961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 1020 

I II M I I M I I I I I I I M I I M i I If M I I I I I I I I I I I I M I II M I I I I I I I I I I M I 

Db 961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 1020 

Qy 1021 TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCACC 1080 

I I I I I I M I I I I ! I I I I I I II I I M I I I I II II i II I I I! I I I I M I I I I I I II I II I I I 

Db 1021 TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCT^ACCAGTCCTTCCGCATCACC 1080 

Qy 1081 ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGACTGT 114 0 

I I I M I M I I M II M I M M M M M I I I M II I I II II M I M M I I I I M I I M M I 

Db 1081 ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGACTGT 114 0 

Qy 1141 TACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATCATGGAG 1200 

I M M I M I I M I I M I I II M M M I I M I M M I M II M M M I II II M I I I I M I 

Db 1141 TACAAGT T TGCCAT C T CACAGT CATC CAC GGGCACT GTTATGGGAGCT GT TATCAT GGAG 1200 

Qy 12 01 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 12 60 

I I I M I M II I I I I M I I I I M II I I I I I I I I I II I II I II I I I I I I I I I I I I I I I M 1 I 

Db 1201 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1260 

Qy 1261 CAT GT G CAC GAT GAGT T C AG GAC G G CAGC GGT G GAAG G CCCTTTTGT CAC C T T GGACAT G 132 0 

I I I M I I I I I I I I I I I II M I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I M 

Db 12 61 CAT GT GCAC GAT GAGT T CAG GAC G G CAGC GGT G GAAG G CCCTTTTGT CAC C T T GGACAT G 132 0 

Qy 1321 GAAGACT GTGGCTACAACATT C CACAGACAGAT GAGT CAAC CCT CAT GAC CATAGC CTAT 1380 

M II I I I I I I I I I I I i I I M I I I I I I I I I M I I I I I II I I N I I I I I II I II I I I I I I II 

Db 1321 GAAGACT GT GGCT ACAACATT C CACAGACAGAT GAGT CAAC CCT CAT GAC CAT AGC CTAT 1380 

Qy 1381 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 144 0 

M II I I I I I I M I I I I I II I I I I I M I I I I I I I I I I I M I M I I I M I I I I I I I M I I I I 

Db 1381 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 1440 


Qy 1441 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 

I I I I I M i I II II I ! I I I I I I II I I I I I II I I I I I I I I I I I I I I II I I I I I I M I M M I 
Db 1441 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 


Qy 1501 AAGT GAGGAGGCCCAT GGGCAGAAGAT AGAGATTCCCCT GGACCACACCTCCGT GGTT CA 1560 

I I I I I I I II I I I M I I M II I I I I I I M I I I I I I I I I I I I I I II M I I 1 I II I M I I M I 

Db 1501 AAGT GAGGAGGCCCAT GGGCAGAAGAT AGAGATTCCCCT GGACCACACCTCCGT GGTT CA 1560 

Qy 1561 CT T T GGT CACAAGT AGGAGACACAGAT GGCAC CT GT G GC CAGAGCAC CT CAGGACC CT C C 1620 

II II I I I M I I I M I I I I II II I I II M I I I I I I I I I I I I M II I I I I I I I I M I I I M I 

Db 1561 CT T T GGT CACAAGT AGGAGACACAGAT GGCAC CT GT GGCCAGAGCAC CT CAGGACC CT C C 1620 

Qy 1621 C CAC CCAC CAAAT GCCTCTGCCTT GAT GGAGAAGGAAAAGGCT G GCAAGGT GGGTT CCAG 1680 

I I M I M II I I I I I I I M M I I II M I I I I I I I I I I I M I I M I I 1 I I II I I I II I II II 

Db 1621 CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCAG 1680 

Qy 1681 GGACT GT AC CT GT AGGAAACAGAAAAGAGAAGAAAGAAGCACT CT GCT GGC GGGAAT ACT 1740 

I I I I I II I I I II I I I I I I I I I I I I M I M I I I I I I II II I I II I I I I I I I I M II I I I I I 

Db 1681 GGACT GT AC CT GT AGGAAACAGAAAAGAGAAGAAAGAAG CACT CT GCT GGC GGGAAT ACT 174 0 

Qy 1741 CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCT 1800 

I I M I M M I II I I I I I I M I I I I M I M I I I I I M I I I M I I I 1 I I I 11 II I M I I I I I 

Db 1741 CTTGGTCACCT CAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCT 1800 

Qy 1801 TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 18 60 

M II I M II I I I I I II I I I I I 1 1 I II I M I I M I M I I I I I I M I I M I I I M I I I I I I I 

Db 180.1 TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 1860 

Qy 1861 GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAAG 1920 

M M I I I I I II I M II I II M II M II I II I I I II II I I I II II I II I I I I II I I I I I I I 

Db 18 61 GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAAG 1920 

Qy 1921 AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCTATT 1980 

II M I I I I M I I II I I I I I I i I I I II M I I I I I I I I I I I I I M M I M I I I I I I I M M I 

Db 1921 AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCTATT 1980 

Qy 1981 TGCTTTAGAGACAGGGACTGTATAAACAAGCCTT^CATTGGTGCAAAGATTGCCTCTTGA 204 0 

I M M I II I I II II II I If I M I M M I M M I II II I I I I M M M I I M I M M I II I 

Db 1981 TGCTTTAGAGACAGGGACTGTATAAACAAGCCTAACATTGGTGCAAAGATTGCCTCTTGA 2040 

Qy 2041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2070 

M I I M M I M I M M I I M I I II I I M M 

Db 2041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2070 


RESULT 4 
US-09-009-191-1 

; Sequence 1, Application US/09009191 

; Patent No. 6319689 

; GENERAL INFORMATION: 

APPLICANT: POWELL, DAVID 
APPLICANT: CHAPMAN, CONRAD 

; APPLICANT: MURPHY, KAY 
APPLICANT: SMITH, TRUDI 
TITLE OF INVENTION: ASP2 

; NUMBER OF SEQUENCES: 6 

; CORRESPONDENCE 7VDDRESS: 


ADDRESSEE : RATNER & PRESTIA 
; STREET: P.O. BOX 980 

CITY: VALLEY FORGE 

STATE: PA 

COUNTRY: USA 

ZIP: 19482 
; COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 
; COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ for Windows Version 2.0 
; CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/009, 191 
; FILING DATE: 20-JAN-1998 

CLASSIFICATION: 
PRIOR APPLICATION DATA: 
; APPLICATION NUMBER: UK 9701684.4 

FILING DATE: 28-JAN-1997 
; ATTORNEY/ AGENT INFORMATION: 

NAME: PRESTIA, PAUL F 

REGISTRATION NUMBER: 23,031 
; REFERENCE/ DOCKET NUMBER: GH-70368 

TELECOMMUNICATION INFORMATION: 

TELEPHONE: 610-407-0700 

TELEFAX: 610-407-0701 

TELEX: 846169 
; INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 2541 base pairs 
; TYPE: nucleic acid 

STRANDEDNESS: single 
; TOPOLOGY: linear 

MOLECULE TYPE: cDNA 
US-09-009-191-1 

Query Match 98.2%; Score 2032; DB 4; Length 2541; 

Best Local Similarity 99.5%; Pred. No. 0; 

Matches 2049; Conservative 0; Mismatches 10; Indels 1; Gaps 1; 


Qy 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

I I I I II I I I I I I f I I M I I I I 1 I I M I I I M I I ( I i I II I I I I I M I t I M I M I I I I I I 

Db 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

Qy 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 120 

I I I I I I I I I I I I II I M I I I I I I I I I 11 I I I I I I I I I i I I M II I I I I I I I I I I I I M M 

Db 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 120 

Qy 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 18 0 

I M I I I II M I I I I I II I I I I I I I I I I I I II I I I I I I M I I II I II I I I I I I II I I I I I I 

Db 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 180 

Qy 181 GT G GAGAT GGT GGACAAC CT GAG GGGCAAGT C GGGGCAGGGCTACT AC GT GGAGAT GACC 24 0 

II I I I I I I I II I I I I I II I II I I I I II II I I M M I I II I I I I I I I I I I I I I I II I I I II 

Db 181 GT GGAGAT GGT GGACAAC CT GAGG GG CAAGT C GGGGCAGG G CT ACTACGTGGAGAT GACC 240 

Qy 241 GT GGGCAGC C C C C C GCAGAC GCT CAACAT C CT G GT GGATACAGGCAGCAGTAAC TT T GCA 300 

I I I I I I I I I I I M I M I II I I I I I I II I II I I I I I I I I M I I I I I I I I I I I I II II M 11 


Db 241 GTGGGCAGC C C CC CGCAGAC GCT C AACAT C CT GGT GGAT ACAGGCAGC AGT AACT TT GCA 300 

Qy 301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 3 60 

I I I I I I I I I I I I I I I I I I II I I I M I I I I I I I II I I I I I I I I M I I I I I I I M I I I I II I 

Db 301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

Qy 361 TACCGGGACCTCCGG7VAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 420 

I I I I I I I II M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I M I I I I I 

Db 361 TACCGGGACCTCCGGAAGGGTGTGTATGAGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 420 

Qy 421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 480 

I I I I I I I I I I I I I I M I I I I I I I I I I I I M I M I I I I I I I I I I I I I I I 1 I I I I I I I I I I I 

Db 421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 480 

Qy 481 GCT GCCAT CACT GAAT CAGACAAGTT CTT CAT CAACGGCT C CAACTGGGAAGGCAT CCT G 54 0 

I I I I I I I I I I I I M I I I I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 481 GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATCCTG 54 0 

Qy 541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

I I I I I I I I I I I I I I I I I M I I I I M I I I I I I I I I I I I I I M I I I I I I I I I I I M I I I I M , 

Db 541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

Qy 601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

I I I I I I I I I I I I I I I I I I M I I I I I I I I I I M I I I I I I I I I I M 11 I I I I I I I M I M I 

Db 601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCAGCTTTGTGGTGCTGGCTTC 660 

Qy 661 CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGGTATC 720 

I I I I I I I I I I M I I I M I I I I II I i I I I I I II M I I I II II I I I I I I M M II I I 11 I II 

Db 661 CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGGTATC 720 

Qy 721 GAC CACTC GCT GTAC ACAGGCAGT CT CT G GTATACAC C CAT CC G GC GGGAGTG GT ATTAT 780 

I I I I I I I I M If I I I II I I II I II I I I I I II M II I I! I M I I II M II 11 II I I M I II 

Db 721 GAC CACTC GCT GTACACAGGCAGT CTCT G GTATAC ACC CAT CC GGC GGGAGTGGTATTAT 780 

Qy 781 GAGGTCATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGCAAGGAG 840 

Mill I II I I II I I I I I I I I II M M I M I II I M I M M I I M II M M I I I M I I M 

Db 781 GAGGTGATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGCAAGGAG 840 

Qy 841 TACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCC7VAGAAA 900 

I I M I I M M I M I I M I I M M M M I M M I I I M M M I M M I I M I M I M M M 

Db 841 TACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCCAAGAAA 900 

Qy 901 GTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCTGAT 960 

I I I I I I M I II M M I II I M I II M I M M I M I M I M I I I I I I I I I I I II I I I M I I 

Db 901 GT GT T T GAAG C T GC AGT C AAAT C CAT C AAGGC AG CCT CCT C C AC GGAGAAGT T C C C T GAT 960 

Qy 961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 1020 

I I I I I I I I I M I I I I I M M I I I I I I II I M I I I I I I M M I I I II I I I I I II I I I M I I 

Db 961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 1020 

Qy 1021 T T CC CAGT CAT CT CACT CT ACCTAAT GGGT GAGGTTACCAAC CAGT C CT T C C GCAT CACC 108 0 

I I I I I I I I I I I I II I II I I II I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I II I I I 

Db 1021 TTCCCAGT CATCT CACT CTACCTAATGGGT GAGGTTACCAAC CAGT C CTTCC GCAT CACC 1080 

Qy 1081 ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGACTGT 1140 

I II I M I M I I I I I I I I I I I I I I I I II I M II I I I M M I I M I I I I I I I I I II I I I I I I 

Db 1081 AT CCTT C C GCAGC AAT AC CT GC GGC CAGT GGAAGAT GT GGC C AC GT C CCAAGACGACT GT 1140 


Qy 1141 T ACAAGTTT GCCAT CT CACAGTCAT CCACGGGCACT GTTAT GGGAGCT GTTAT CAT GGAG 1200 

f M I I I I I M I M I I M I I II I II I M I M I I I M I I It II I If I I I M I I t t I I I M I I 

Db 1141 TACAAGT T T GCCAT CT CACAGT CAT CCACGGGCACT GTTAT GGGAGCT GT TAT CAT GGAG 1200 

Qy 1201 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1260 

I I I M I I I I I I M I I I I I N I M I 1 I I II I I I I I I M I I I I ! I I I I I II I I I II II I I I I 
Db 1201 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1260 

Qy 12 61 CAT GT G C AC GAT GAGT T C AGGAC GGC AGC GGT GGAAGGC CCTTTTGT C AC CT T GGAC AT G 1320 

I M I I I I M M M I I I I II I I II M I I I I M I I I I I I M I I M M ! I I I I I I II I II M I 
Db 1261 CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGACATG 1320 

Qy 1321 GAAGACT GT GGCT ACAACATT CCACAGACAGAT GAGT CAACCCT CAT GAC CAT AGCCT AT 1380 

M I I II II I I I I I M I I I I I M I I I I I I I I I II I I I I I I II I I I I I I I II I I M I I II I I 
Db 1321 G AAG AC T GT G G CT AC AAC AT T C C AC AG AC AG AT GAGT C AAC C C T CAT GAC CAT AG C C TAT 1380 

Qy 1381 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 14 4 0 

I I M I I I I I I I II I M I I I I I M I I I I I I I I I M I 1 I I II II 11 M I I I I I I I I I I I M I 
Db 1381 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 1440 

Qy - 1441 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 

I I I I M I II M I I M I II II I I I I II I M I M II I I I I I I I I II I I M I I I I I I I M I I I 

Db 1441 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 

Qy 1501 AAGTGAGGAGGCCCATGGGCAGAAGATAGAGATTCCCCT-GGACCACACCTCCGTGGTTC 1559 

I I I I I I M I I I II I M I I I I I I II I I I M I I I II I I I I I M I I I 1 M I I I I II M I 
Db 1501 AAGT GAGGAGGCCCAT GGGAGAAAGATAGAGATT CC C CTGGGAC CACAC CT C C GT GGTT C 1560 

Qy 1560 ACTTT GGTCACAAGT AGGAGACACAGAT GGCACCT GT GGC CAGAGC ACCT CAGGACCCT C 1619 

M I I M I I II I ! M I I M I M I I I I I I I I I I I I I II I I I II M I I I I M I I M I I I M I I 
Db 1561 ACTTTGGTCACAAGT AGGAGACACAGAT GGC ACCT GT GGCCAGAGCACCT CAGGAC CCT C 1620 

Qy 1620 CCCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCA 1679 

I I I II I 11 I I I M I M I II I I M I I M I I II I I I M I II I I I I I I I II I I M II I I I I I I 
Db 1621 C C C AC C C AC C AAAT GCCTCTGCCTT GAT GGAGAAGGAAAAGGC T GGC AAGGT GGGTT C C A 1680 

Qy 1680 GGGACT GT AC CT GT AGGAAACAGAAAAGAGAAGAAAGAAGCACT CT GCT GGC GGGAAT AC 1739 

M M I I I I M I I M M I I I I M I I II II M I I M I I I I I I II M I II I I II II II I II M 
Db 1681 G GGACT GT AC CT GT AGGAAACAGAAAAGAGAAGAAAGAAGCACT CT GCT GGCGGGAAT AC 174 0 

Qy 1740 TCTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACC 1799 

I M II I M M 1 M I I 1 I I II 1 I I II 1 M I 11 1 I I I I II 11 1 M I I I I I I I I M I I I I II I 

Db 1741 TCTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACC 1800 

Qy 1800 TT T GT CCAC CAT T C CT TT AAAT T CT CCAAC C CAAAGT AT TCTTCTTTT CT TAGTT T CAGA 1859 

M I I I I I II II M M I M I I I I I M II II I II I M I I M I I I I M I M I I I I I M I I I M 

Db 18 01 TTTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGA 1860 

Qy 18 60 AGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAA 1919 

I I II I I I I I I I I M I I M I I I I I II I I I I I I I I M I I I I II I I I I II I I I MINIM! 

Db 1861 AGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCGGGCAGAGAA 1920 

Qy 1920 GAGAC CAAGCT T GTTTCCCTGCT GGC CAAAGT CAGT AGGAGAGGAT GCACAGT T T GCTAT 1979 

II I II I I M I I II I I I I I II I II I I II I II M I I I II M I I I M I M I I I I I II I I I I M 

Db 1921 GAGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCTAT 1980 


Qy 1980 TTGCTTTAGAGACAGGGACTGTATAAACAAGCCTAACATTGGTGC7\AAGATTGCCTCTTG 2039 

I I I! I M I I I I I I I I I I I I I I I I I I I i I I I M I M 1 I I I I M M I I I I M I I I I I I I I I I 

Db 1981 T T GCT T T AGAGACAGGGACT GT AT AAACAAGC CT AACAT T G GT GCAAAGAT TGCCTCTTG 2040 

Qy 2040 AATTAAAAAAAAAAAAAAAA 2059 

I I I I I M I I I I I I II I I 
Db 2041 AAT T AAAAAAAAAAACT AGA 2060 


RESULT 5 
US-09-604-608-1 

; Sequence 1, Application US/09604608 

; Patent No. 6545127 

; GENERAL INFORMATION: 

; APPLICANT: Tang, Jordan J.N. 

; APPLICANT: Lin f Xinli 

; APPLICANT: Koelsch, Gerald 

TITLE OF INVENTION: Catalytically Active Recombinant Memapsin and Methods 
; TITLE OF INVENTION: of Use Thereof 
; FILE REFERENCE: OMRF 17 9 

; CURRENT APPLICATION NUMBER: US/09/604 , 608 

; CURRENT FILING DATE: 2000-06-27 

; PRIOR APPLICATION NUMBER: 60/141,363 

; PRIOR FILING DATE: 1999-06-28 

; PRIOR APPLICATION NUMBER: 60/168,060 

; PRIOR FILING DATE: 1999-11-30 

; PRIOR APPLICATION NUMBER: 60/177,836 

; PRIOR FILING DATE: 2000-01-25 

; PRIOR APPLICATION NUMBER: 60/178,368 

; PRIOR FILING DATE: 2000-01-27 

; PRIOR APPLICATION NUMBER: 60/210,292 

; PRIOR FILING DATE: 2000-06-08 

; NUMBER OF SEQ ID NOS : 31 

; SOFTWARE: PatentlnVer. 2.1 

; SEQ ID NO 1 

LENGTH: 3252 

TYPE: DNA 

ORGANISM: Homo sapiens 
US-09-604-608-1 

Query Match 97.1%; Score 2010; DB 4; Length 3252; 

Best Local Similarity 99.8%; Pred. No. 0; 

Matches 2013; Conservative 0; Mismatches 5; Indels 0; Gaps 0; 


Qy 4 0 GCGGGAGTGCTGCCTGCCCACGGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGC 99 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I M I I I I I I I I I I I I I M II I I I II 

Db 1 GCGGGAGTGCTGCCTGCCCACGGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGC 60 

Qy 100 CTGGGGGGCGCCCCCCTGGGGCTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAG 159 

I I I I I I I I I I M I I I I I I I M I I I I I I I I I I II I I I II I I I I I I I I 1 I I I I I I I II I I I I 
Db 61 CTGGGGGGCGCCCCCCTGGGGCTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAG 120 

Qy 160 CCCGGCCGGAGGGGCAGCTTTGTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAG 219 

I I I II I I I I I I I I I I I I I I I M I 1 I I I I M I M I I I I I I I I II I I II I I I I I I I I II I I I 

Db 121 CCCGGCCGGAGGGGCAGCTTTGTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAG 180 

Qy 220 GGCT ACT AC GT GGAGAT GAC CGT GGGCAG C CC CC C GCAGAC GCT CAACAT C CT G GT GGAT 279 


I I I I I ) I I I I M I I I II II I I I I M M I I M I 1 I I I I I I I I I i I I I M I M I I M I I I I I 

181 GGCTACTACGTGGAGATGACCGTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGAT 240 

28 0 ACAGGCAGCAGTAACTTTGCAGTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTAC 339 

I I I I I I II I I I I I I I II I I I I I II I I II I I I I I I II I I I II I I I I II I M I I I I I II M I 

241 ACAGGCAGCAGTAACTTTGCAGTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTAC 300 

34 0 CAGAGGCAGCTGTCCAGCACATACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACC 399 

I I I M I I I M I II I II I M I II I I I I I I I I I M I I I II II 1 M I I I I I I I II I I I I I I M 

301 CAGAGGCAGCT GT C CAGCACAT AC C G GGAC CT C CG GAAGGGT GTGT AT GT GC CCT ACAC C 360 

400 CAGGGCAAGTGGGAAGGGGAGCTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAAC 4 59 

I I I I I I I I I I I I I I I M I II M I I I I I I I I I I II I I I 1 I II I I I I I I I I I I I I I I I I I M 
361 CAGGGCAAGTGGGAAGGGGAGCTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAAC 420 

4 60 GT CACT GT GCGT GCCAACATT GCT GC CAT CACT GAAT CAGACAAGTT CTT CATCAACGGC 519 

I I I I I I I I I I I I I I I I II I I I I I I M I I I I I II I I I I II I I I I I I II I II M I I I I II M 

421 GT CACT GT GCGT GC CAACATT GCT GC CAT CACT GAAT CAGACAAGTT CTT CAT CAAC GGC 480 

520 TCCAACTGGGAAGGCATCCTGGGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCC 579 

I I I I II M I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 11 II I I I M I I 
481 TCCAACTGGGAAGGCATCCTGGGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCC 540 

580 CTGGAGCCTTTCTTTGACTCTCTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTG 639 

M I I II I I I I I M M I I I I I I I I I M I I I I I I M II M II I M M I M M I II I I I I M I 

541 CTGGAGCCTTTCTTTGACTCTCTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTG 600 
640 CACCTTTGTGGTGCTGGCTTCCCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGG 699 

I i I I M I I If I I II I I M I I I I I M I II M I I M I I I II II i I I II I I I I I I I I I I I M 

601 CAGCTTTGTGGTGCTGGCTTCCCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGG 660 
700 AGCATGATCATTGGAGGTATCGACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCC 759 

I I I I II I M II I I M M I I II II I I M I I M I M II II I I I I M I M I M I I M I MM I 

661 AGCATGATCATTGGAGGTATCGACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCC 720 
760 ATCCGGCGGGAGTGGTATTATGAGGTCATCATTGTGCGGGTGGAGATCAATGGACAGGAT 819 

I I I I I I I I II I I I 1 I I I I I I I I I I M II M I I M M II M II I I I I I I M I M I II M I 

721 AT CCGGC GGGAGTGGTAT TAT GAGGT GAT CAT T GT GCGGGH GGAGAT CAATGGACAGGAT 780 
820 CTGAAAATGGACT GCAAGGAGTACAACTATGACAAGAGCATT GT GGACAGTGGCAC CAC C 879 

M I I I M M II I M M I I I M I II II II II M I I II II II I M II M II I II II I II II I 

781 CT GAAAAT GGAC T GCAAGGAGTACAACTAT GACAAGAGC ATT GT GGACAGT GGCAC CAC C 84 0 

880 AAC C TT CGT TT GCC CAAGAAAGT GT T T GAAGCT GCAGT CAAAT CCAT CAAGGCAG CCT C C 939 

II I I I I i I I II I I I II I I I I II I II I I I II I I II I I I I I II I I M I I I M I II I I II I II 

841 AAC CTT C GT TT GCC CAAGAAAGT GTT T GAAG CT GCAGT CAAAT CCAT CAAGGCAG CCT C C 900 

94 0 TCCACGGAGAAGTTCCCTGATGGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCA 999 

II II II I II I I II I II I M I II I I I I I I I II II II M I I I I I M II I M I I I I I I II I I I 

901 T C CAC GGAGAAGTT C C CT GAT GGTTT CT G G CT AGGAGAGCAGCT GGT GT GCT GGCAAGCA 960 

1000 GGCACCACCCCTTGGAACATTTTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACC 1059 
I M I M I 1 I I I I I I II I M I I M I I I I I I M I II M M I I I I M M M II I II M II I I I 

961 GGCACCACCCCTTGGAACATTTTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACC 102 0 

1060 AAC C AGT CCTT C C GCAT CACCAT C CT T C C GC AGCAAT AC CT GC GGC CAGT GGAAGAT GT G 1119 
I II I II I I II I I I I I I II I I I M I I I I I I I I I I I II I I I I II II I I I I I II II I II I I I I 


Db 1021 AACCAGTCCTTCCGCATCACCATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTG 1080 

Qy 1120 GCCACGTCCCAAGACGACTGTTACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTT 117 9 

I I I I I M I I I I I I I I I I II I I I I I I I I I I I I I II I I II I I I I i I I I I I I I M I I I I I I II 
Db 1081 GC CAC GT C C C AAGAC GACT GT T AC AAGT T T GC CAT CT C AC AGT CAT C C AC GGGCAC T GT T 114 0 

Qy 1180 AT GGGAGCT GTT AT CATGGAGGGCT T CT AC GT T GT CT T T GAT CGGGCC CGAAAAC GAAT T 1239 

I I I I I I II I I M I I I I I I I I M M I II I I I I I I I I I i I I M I I I I I I I (I I I I I I I I M I 
Db 1141 ATGGGAGCTGTTATCATGGAGGGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATT 1200 

Qy 1240 GGCTTTGCTGTCAGCGCTTGCCATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGC 1299 

I II I I 1 I ) I I I I I I I I I I I I I I I I I I II I i I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I 

Db 1201 GGCTTTGCTGTCAGCGCTTGCCATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGC 1260 

Qy 1300 CCTTTT GT CACCTT GGACAT GGAAGACT GT GGCTACAACATT C CACAGACAGAT GAGT CA 1359 

II I I I I I I I I M M I I II I I I I I I M I I I I I I II I II M I i I I I I II I I I I M I M II I I 

Db 1261 CCTTTTGT CACCTT GGACATGGAAGACT GT GGCTACAACATT CCACAGACAGAT GAGT CA 1320 

Qy 1360 ACCCTCATGACCATAGCCTATGTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTC 1419 

I I I I I I I M I I M I II M I II I I I I I I I I M I I M I I I j I I I I M I I I I I I I I I II I I 1 I 

Db 1321 ACCCTCATGACCATAGCCTATGTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTC 1380 

Qy 1420 TGCCTCATGGTGTGTCAGTGGCGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTT 1479 

II I I I I I II I I II I I I M I I I M I I I I I I I I I II I I I I M I I I I M I I I I I I I I I I I I I I 

Db 1381 TGCCTCATGGTGTGTCAGTGGCGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTT 1440 

Qy 1480 GCT GAT GACAT CT CC CT GCT GAAGT GAGGAGGC C CAT GGGCAGAAGAT AGAGATT CC C CT 1539 

I M I I I I I I M I I I II I II M I II I II I I II I I I II II I M I I I II I II M I I I I II I I I 

Db 14 41 GCTGATGACATCTCCCTGCTGAAGTGAGGAGGCCCATGGGCAGAAGATAGAGATTCCCCT 1500 

Qy 1540 GGAC CACAC CT C C GT GGTTCACTTT GGT CACAAGT AGGAGACACAGATGGCAC CTGTGGC 1599 

II I M II M I I M II I M M M M II M I I I M I I I II M I M II II I II I M I M M I I 

Db 1501 GGACCACACCTCCGTGGTTCACTTTGGTCACAAGTAGGAGACACAGATGGCACCTGTGGC 1560 

Qy 1600 CAGAGCAC CT CAGGAC C CT CCCCAC C C ACCAAAT GCCT CT GC CTT GAT G GAGAAGGAAAA 1659 

I I I M I I I I I I I II I I I I I I I I II I II I I I I I I I i I I I II I M I II I M I I I II I M I I I 
Db 1561 CAGAGCACCTCAGGACCCTCCCCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAA 1620 

Qy 1660 GGC T GGC AAGGT GGGT T C C AGGG AC T G T AC C T G T AGG AAAC AGAAAAGAGAAGAAAGAAG 1719 

I II M I I M I II I I I II I M I I I M I I M I I I II II I II I I I I II M I I I I M I I I I I M 

Db 1621 GGCT GGCAAGGT GGGTT CCAGGGACT GTACCT GTAGGAAACAGAAAAGAGAAGAAAGAAG 168 0 

Qy 1720 CACTCTGCTGGCGGGAATACTCTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCT 1779 

I II I I II M I I M M I I I II M I I I I I I I I II M II I M I II M I I I I M I I M I I I M I 

Db 1681 CACTCTGCTGGCGGGAATACTCTTGGTCACCTCAAATTT7KAGTCGGGAAATTCTGCTGCT 174 0 

Qy 17 80 TGAAACTTCAGCCCTGAACCTTTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATT 1839 

I M I I I I I I I M I I I I I I I I I I I I I I I M I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I 
Db 1741 TGAAACTTCAGCCCTGAACCTTTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATT 1800 

Qy 1840 CTTCTTTTCTTAGTTTCAGAAGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCC 1899 

I M I I I II I I II I I I I I I I M I I I I I II I I II I I II I I I M I M I I I I I II I I I II M I I 
Db 1801 CTTCTTTTCTTAGTTTCAGAAGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCC 1860 

Qy 1900 TGTGGTACCCTGGCAGAGAAGAGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGA 1959 

I I I I I I I I I I II I II I I I I II I I I I I I II II I II I 1 I I I I I I I I II I 1 I I I I I I I I I I I I 
Db 1861 TGTGGTACCCTGGCAGAGAAGAGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGA 1920 


Qy 1960 GAGGATGCACAGT TT GCTAT T T GCTTTAGAGACAGGGACT GTATAAACAAGC CT AACATT 2019 

I I I II II I I M I M I I I I f I I M I M I I I I I I I I ( II t I It (I I I I 1 I I t I M I t 1 M M 

Db 1921 GAGGATGCACAGTTTGCTATTTGCTTTAGAGACAGGGACTGTATAAACAAGCCTAACATT 1980 

Qy 2020 GGTGCAAAGATTGCCTCTTGAATTAAAAAAAAAAAAAA 2057 

I I I I M I I II M II M I II I I II M I I I M I M I I 

Db 1981 GGTGCAAAGATTGCCTCTTGAATTAAAAAAAAACTAGA 2018 


RESULT 6 

US-09-548-372D-5 

; Sequence 5, Application US/09548372D 

; Patent No. 6420534 

; GENERAL INFORMATION: 

; APPLICANT: GURNEY ET AL. 
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; TITLE OF INVENTION: THEREOF 
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; CURRENT FILING DATE: 2000-04-12 

; PRIOR APPLICATION NUMBER: US 60/155,493 
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; PRIOR APPLICATION NUMBER: US 09/404,133 

; PRIOR FILING DATE: 1999-09-23 

; PRIOR APPLICATION NUMBER: PCT/US99/20881 

; PRIOR FILING DATE: 1999-09-23 

; PRIOR APPLICATION NUMBER: US 60/101,594 
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; NUMBER OF SEQ ID NOS : 73 
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Query Match 91.2%; Score 1887.2; DB 4; Length 1977; 

Best Local Similarity 96.2%; Pred. No. 0; 

Matches 1974; Conservative 0; Mismatches 3; Indels 75; Gaps 1; 


Qy 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

M I I II I M M I I I I I M I M I I I M M I M I I M I I I I I M M I II I M I M I II I I M 

Db ' 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

Qy 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 120 

II I I I I T I I I II I I I I II 1 II I I I I I I I I I M I M I I M I I M I I I M I I I M I I I 1 I I I 

Db 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 120 

Qy 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 180 

I I II I I I M M I II I I M I I I I I I M I I I I I M M I I I II I II II I I I I I M M I I I I M 

Db 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 180 

Qy 181 GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGATGACC 240 

I M I II I I I I II I I I I I I 1 I I I I I 1 I I II I I I I I I I I I I M I I I I I I M I I I I I I I I I II 
Db 181 GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGATGACC 240 


241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 300 

i I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I 

241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 300 

301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

M I I I I I M M I I I I I I t I I I I I I I I I I I I I I I M I I I I I I 1 I I I 1 I I M I I I 1 I M M I 

301 GTGGGTGCTGCCCCC CAC C C CTT C CT GCAT C GCT AC TAC CAGAGGCAGCT GT C CAGCACA 3 60 

361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 420 

II I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I 

361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 420 

421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 4 80 

M M M I I II I I II I I I I I M M I t M It I M I I I I I I I I I I I I I II II I M I M I I! M 

421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 480 

481 GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATCCTG 540 

I I I I I I I I I II I I ! I I M I f II I M II M I I I II ! I I I I I I I I I I I I I I I I M I i I I I I I 

481 GCT GC CAT C ACT GAAT CAGACAAGTT CTT CAT CAAC GGCT C CAACT GGGAAGGCAT C CT G 540 

541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

II I I I I I II M I I II II I I I I I I I M I 

541 GGGCTGGCCTATGCTGAGATTGCCAGG 567 

601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

I I I I M 1 I ! I M I M I I 1 

568 CTTTGTGGTGCTGGCTTC 585 

661 CCCCT CAACCAGT CT GAAGT GCT GGCCT CT GT CGGAGGGAGCAT GATCATT GGAGGTAT C 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I II I 

586 CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGGTATC 645 

721 GAC C ACT C G C T GT AC AC AG G C AGT C T C T GGT AT AC AC C CAT C C G G C GG GAGT G GT AT TAT 780 
I I I I II I I I II I II I I I I II I II I I I I I I I I II I I II I I II II I M I I I I I II I II I I I I 

646 GACCACTCGCT GTACACAGGCAGT CT CTGGTAT ACACCCAT CCGGCGGGAGT GGTATTAT 705 

781 GAGGTCATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGCAAGGAG 840 

I I I I I II I I II I I M II I I I I I I i I I 11 I I I I I I I I I I M I I I I M I I I M I I I M I I I 

706 GAGGT GAT CATT GT GCG GGT GGAGATCAAT GGACAG GAT CT GAAAAT GGACT GCAAGGAG 7 65 

841 TACAACTAT G ACAAGAGC ATT GT GGACAGT GGCAC CAC CAAC CTT C GTTT GCC CAAGAAA 900 

I I I I I II I I I M I 1 M II II 1 M I I M M I II I I M I I I M I I 1 I M I II I I I I M I I I I 

7 66 T AC AAC TAT GAC AAGAGC AT T GT GGACAGT GGCAC CAC CAAC CTTCGTTTGCC CAAGAAA 825 

901 GT GTTT GAAGCT GCAGT CAAAT CCAT CAAGGCAGCCT CCT CCACGGAGAAGTT CCCT GAT 960 

I I I M II II I I I I M II I I II I M I I I I I I II I I I I I I I I I I I I I I M I I I I M I I M M 

826 GTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCTGAT 885 

961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 1020 

I II I I I I I I I I I I I I I I I II I I I I I I I I I I I (I I I I I I (I I I I I I I I I I I I I I I I I I I I I 

886 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 945 

1021 TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCACC 1080 

I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

946 TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCACC 1005 


Qy 1081 ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGACTGT 114 0 

I I I I I I I I I I I I M I I I II I M I I I I M I I I 1 I I I I I I I I M I I I I I I I I I I I I I M I I I 

Db 1006 ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGACTGT 1065 

Qy 1141 T AC AAGT T T GC CAT C T CAC AGT CAT C C AC GG GC AC T GT TAT GG GAGCT GT TAT CAT GGAG 1200 

I I I I I I I M I I II I M 1 M I If I I I I I I I If I I I I I I I I I I I I I I I | | I M I I I H I I I I 
Db 1066 T ACAAGT T T GCCAT CT CACAGT CAT C CACGGGCACT GTTAT GGGAGCT GT TAT CAT GGAG 1125 

Qy 1201 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1260 

I I I M I I I I I I I I I I I I I I I I I M M I I M I I I I! I I I I I I II I M I I I I I I I I I II I I I 

Db 1126 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1185 

Qy 1261 CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGACATG 1320 

M I I 1 I I I M I I M I I I I II M I I I I 1 M M I M I M I I 1 I M II I I 1 I I II M 1 I M it 

Db 118 6 CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGACATG 124 5 

Qy 1321 GAAGACT GT GGCTACAACAT T CCACAGACAGAT GAGT CAAC CCT CAT GAC CAT AGCCT AT 1380 

I I II I M I I I I I I I I II I I II I I I M M I I I M I I I M I M I I I I II I M M I II I I I M 

Db 1246 GAAGACT GT GGCTACAACATT CCACAGACAGAT GAGT CAAC CCT CATGACCAT AGCCTAT 1305 

Qy 1381 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 144 0 

It I I II I I I I I I M It M II M I II II II I I I 1 I I I I M I I I I M I I I I M I I I I I II I I 

Db 1306 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 1365 

Qy 1441 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 

I M M M I I M M I M I M I I I I M M I M M I I M I I I M I I I II M M II M M M I I 

Db 1366 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1425 

Qy 1501 AAGTGAGGAGGCCCATGGGCAGAAGATAGAGATTCCCCTGGACCACACCTCCGTGGTTCA 1560 

1 M I M I I I I I I I I I I I I I II I I I I I I I I 1 I I I I I I I 1 II I I I I 1 I I I I 1 II 11 M I I I I 

Db 1426 AAGT GAG GAG GC C CAT GGGC AGAAGAT AGAGAT T C C C C T GGAC CAC AC C T C C GT GGT T C A 14 85 

Qy 1561 CTTT GGT CACAAGTAGGAGACACAGAT GGCACCT GTGGCCAGAGCACCT CAGGACCCT CC 1620 

II M M M I! I I II II I M I II I I I I I I I I II I I I I I I I I M I II M I I f I I I II I I M I 

Db 1486 CT T T GGT CACAAGTAGGAGACACAGAT G GC AC CT GT G GC CAGAGC AC CT CAGGAC CCT C C 1545 

Qy 1621 CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCAG 1680 

I I I I M I I I I II I I I I I M I M I M I II II I I I I I I I I I I I II I I I I M I I I I I I I I I I I 

Db 1546 CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCAG 1605 

Qy 1681 GGACT GTACCT GTAGGAAACAGAAAAGAGAAGAAAGAAGCACT CT GCT GGCGGGAATACT 174 0 

I It I I I I M I M I I I I M M I I I I M I I II M If M I I I I I I I II I I M I I I I I M I II I 
Db 1606 GGACT GTACCT GT AGGAAAC AGAAAAGAGAAGAAAGAAGCACT CT GCT GGC GGGAAT ACT 1665 

Qy 1741 CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCT 1800 

M M I I I I I I I I M I I II I I I I I II I II I I II I I I I I I I 1 I II I I I I 1 I M I I I I I II I I 

Db 1666 CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCT 1725 

Qy 1801 TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 1860 

I I I I II I f I I I M I' I I I I f I II I I I f I I I I I I I f I I I f I I II f I M II f II I I I I I II ( I 
Db 1726 TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 1785 

Qy 1861 GT ACT GGC AT C AC AC GCAGGT T ACCTT GG C GT GT GT C C CT GT GGT AC CCT G GC AGAGAAG 1920 

I I I M I I I I II II I I I I I I I I I I II I II II I I I I I I I I I I M I I I I I M I I II I I I I I I I 

Db 1786 GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAAG 1845 

Qy 1921 AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCTATT 1980 


Db 


I I I I M I II I I M I II I M M M I I I M M M I II M M I I I I I I I I I I I f M I M I t M 

1846 AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCTATT 1905 


Qy 1981 T GCT TT AGAGACAGGGACT GT AT AAACAAGCCT AACATT GGT GCAAAGATT GC CT CT T GA 2040 

I I I M II I M I I I M I I I I M I I I I I I I I M II I I I I I I I M I I I II I I M I I I I I I I I I 

Db 1906 TGCTTTAGAGACAGGGACTGTATAAACAAGCCTAACATTGGTGCAAAGATTGCCTCTTGA 1965 

Qy 2041 ATTAAAAAAAAA 2052 

I I I M I I II I 
Db 1966 AAAAAAAAAAAA 1977 


RESULT 7 

US-09-548-367D-5 

; Sequence 5, Application US/09548367D 

; Patent No. 6440698 

; GENERAL INFORMATION: 

; APPLICANT: GURNEY ET AL. 

; TITLE OF INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR 
AND USES 

; TITLE OF INVENTION: THEREOF 
; FILE REFERENCE: 29915/6280H 

; CURRENT APPLICATION NUMBER: US/ 09/ 54 8 , 367D 

; CURRENT FILING DATE: 2000-04-12 

; PRIOR APPLICATION NUMBER: US 60/155,493 

; PRIOR FILING DATE: 1999-09-23 

; PRIOR APPLICATION NUMBER: US 09/404,133 

; PRIOR FILING DATE: 1999-09-23 

; PRIOR APPLICATION NUMBER: PCT/US99/2 08 8 1 

; PRIOR FILING DATE: 1999-09-23 

; PRIOR APPLICATION NUMBER: US 60/101,594 

; PRIOR FILING DATE: 1998-09-24 

; NUMBER OF SEQ ID NOS : 73 

SOFTWARE: Patent In version 3.1 
; SEQ ID NO 5 

LENGTH: 1977 

TYPE: DNA 
; ORGANISM: Homo sapiens 
US-09-548-367D-5 

Query Match 91.2%; Score 1887.2; DB 4; Length 1977; 

Best Local Similarity 96.2%; Pred. No. 0; 

Matches 1974; Conservative 0; Mismatches 3; Indels 75; Gaps 1; 


Qy 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

I I M I I I I M M I M I I I I I M M I I I I I I M I M II I I I I I II I II 1 I I I 1 I I I I II II 

Db 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

Qy 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 120 

II I M I I II M I II I I M I II I I M I I I I I M I I II I I I I I I II I I II I I I II I I I II I I 

Db 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 12 0 

Qy 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 180 

I I M II I I M M M I I I M I I M I I M I I M M II I I I M I M I I M I I II M M I M II 

Db 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 180 

Qy 181 GT GGAGAT GGT GGACAAC CTGAGGGG CAAGT C GGGGC AGGGCT ACT ACGT G GAGAT GAC C 24 0 


I I I II M I 1 I II t M II M II I M I I M I I M I i I I M M M I I M M I M I I M I I I M 

Db 181 GT G GAGAT GGT GGACAAC CT GAGGGGCAAGT CGGGGCAGGGCT ACT ACGT GGAGAT GACC 24 0 

Qy 241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 300 

I M M I I M M M M I I I M M M M M I I I M M I M I I II I I I M I I M I I I M I M I 

Db 241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 300 

Qy 3 01 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

I M M I M I I I II I M I I I I I I I ! I II I I I I I I I I I I I I I I I I I I I I I I II M I I M I I I 
Db 301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

Qy 361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 420 

I I I M I M I I M II I I M I M I I M I I I I II I I I I I I I I II I I I I I M II I I I I I I M I I 

Db 361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 420 

Qy 421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 480 

II I I M I I I I M I I I I I M I I I I M I I II I I I M I I I I II I 1 I I M I I I I I I M I I M M 

Db 421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 4 80 

Qy 481 GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATCCTG 540 

II I M I I I I M I I I I I I II M I I I I II I I I I I I I I II II I I I I M I I I I M II I I I I I I I 

Db 481 GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATCCTG 540 

Qy 541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

I I I I II I I I II I I II II I I I I M I I I I 

Db 541 GGGCTGGCCTATGCTGAGATTGCCAGG 567 

Qy 601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

I I I M I I I I I 1 I I 1 I II I 

Db 568 CTTTGTGGTGCTGGCTTC 585 

Qy 661 C C C C T C AAC C AGT C T GAAGT GCTGGCCTCTGTCG GAG GGAGCAT GAT CAT T GGAGGT AT C 72 0 

I II I I I I M M I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I i II I I I I II M I II II I 
Db 586 CCCCT CAACCAGT CT GAAGT GCT GGCCT CT GT CGGAGGGAGCAT GAT CATT GGAGGT AT C 64 5 

Qy 721 GACCACTCGCT GTACACAGGCAGT CT CT GGTATACACCCAT CCGGCGGGAGTGGT ATTAT 780 

M II I I M M I I I II I I I I M I I I II I 1 I I II I I M I I I I I I I I II II I I I I I II I M I I 

Db 646 GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTATTAT 7 05 

Qy 781 GAGGT CAT CATT GT GCGGGT GGAGAT CAAT GGACAGGATCT GAAAAT GGACT GC AAGGAG 840 

I I M I I I M I I II I I I I I I II I I I M II I I M I I I II II I M I M I I I I I I I II I I I I I 

Db 7 06 GAGGT GAT CAT T GT G CGGGT GGAGAT CAAT GGACAGGAT CT GAAAAT GGACT GCAAGGAG 7 65 

Qy 841 T AC AAC TAT G AC AAG AG CAT T GT G G AC AGT G G C AC C AC C AAC CTTCGTTTGCC C AAG AAA 900 

II II I I I M M I I I I I II I I I I I I I I I I I I I II I 1 I II I II I I II II I I I I I I I II I I M 

Db 766 T AC AAC TAT G AC AAG AG CAT T GT G G AC AGT G G C AC C AC C AAC CTTCGTTTGCC C AAG AAA 825 

Qy 901 GTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCTGAT 960 

I I I II I I I I II M I I I I I I II I M I I I I I I I I I I I I I I I I I I I II I I I I I I I II II I I I I 

Db 826 GTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCTGAT 885 

Qy 961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 1020 

I I I I M I I I I I I I I I I M I I I II II I I M II I I M II I I I I I I I I I I I I M I I I I I I I II 

Db 8 86 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 945 

Qy 1021 TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCACC 1080 

II M I I M I I I I I I I I I I I M I M I I I M I I I I II I I M I I I I I I I I I I I I I I II I I I I I 


946 TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCACC 1005 

1081 ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGACTGT 1140 

I I M I I I M I I I I I M I I I I 1 I I I I II I I I I II I II I I M I I I i I I I I I I II M I I I M I 

1006 ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGACTGT 1065 

1141 TACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATCATGGAG 1200 

1 I I I M I I I II I I I I I I I I I M M 1 I I I II M II I I I I II I II I II I I I M I II I M I M 

1066 TACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATCATGGAG 1125 

1201 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 12 60 

I I I I I I I I I I I I I I M I I I I I I I M I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 1 M M 

1126 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1185 

1261 CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGACATG 1320 

M II II I I II I I I I M I II I M I M I M I M I I M I I M I If I I I I M M I I I I! M I II 

1186 CAT GT GCAC GAT GAGTT CAGGAC GGCAGC GGT GGAAGGC C C T TTT GT C AC CT T GGACAT G 1245 

1321 GAAGACT GT GGCTACAACAT T CCACAGACAGAT GAGT CAAC C CT CAT GAC CATAGC CTAT 1380 

I I I I I I! I I I I I I I I I II II II I II I I I I I M II I II I I I I I ! I I II I I I M I I I II I I I 

1246 GAAGACT GT GGCTACAACAT T CCACAGACAGAT GAGT CAAC C CT CAT GAC CATAGC CTAT 1305 

1381 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 1440 

I I I M I II I M I M I I M I I I M I M I I I I I M I M I I I M I M I I M I I II I II I M I I 

1306 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 1365 

1441 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 

I I I M I I I I I I I II I I I II I I I I I I M I I II H I I II I I M M M I M M I II I II I I I I 

1366 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1425 

1501 AAGT GAGGAGGCCC AT GGGCAGAAGAT AGAGATT CCCCT GGACCACACCT CCGT GGTTCA 1560 

I I I I M I I I I I I I I I I I I II I I I I I I I I I II M I I M I M I I I I I I I I I I I I I I I I I II I 

1426 AAGTGAGGAGGCCCATGGGCAGAAGATAGAGATTCCCCTGGACCACACCTCCGTGGTTCA 1485 

1561 CTTTGGTCACAAGTAGGAGACACAGATGGCACCTGTGGCCAGAGCACCTCAGGACCCTCC 1620 

I I I I I I M I I I I I M II II I I I I I II I I I M I II I I I I I II M I II I M M M I I I I II I 

14 86 CTTTGGTCACAAGTAGGAGACACAGATGGCACCTGTGGCCAGAGCACCTCAGGACCCTCC 1545 

1621 CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCAG 1680 

I II I I I I I I I I I I M I I I I I M M M I I 1 M I I II I II I I I I M I I I II I I II I II II I I 

1546 CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCAG 1605 

1681 GGACTGTACCTGTAGGAAACAGAAAAGAGAAGAAAGAAGCACTCTGCTGGCGGGAATACT 174 0 

I I M I 1 I I I I I I II I I I I 1 M 1 M I I I 1 I I I M I I I I M I I I M I I I I 1 M I M M I I I I 

1606 GGACTGTACCTGTAGGAAACAGAAAAGAGAAGAAAGAAGCACTCTGCTGGCGGGAATACT 1665 

1741 CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCT 1800 

I M M I I I I I I I II I I 1 I I I I M I I I I I I II M I I I M I I M M I I M I I I I I II I II I I 

1666 CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCT 1725 

1801 TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 18 60 

I I I M I I M I I I II II I I I M M I M M I I I II I I I I I I I M M II I I II I II I II M I I 

172 6 TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 1785 

1861 GT ACT GG CAT CACAC GCAG GT T AC CT T GGC GT GT GT C C CT GT GGT AC CCT G G CAGAGAAG 1920 

I II M I M II I I I I I I II I II II I II M II M II I I I I I I M I M I II M II I I II M I I 

1786 GTACT GGCAT CACACGCAGGTTAC CTT GGCGT GT GT CCCT GT GGTAC CCT G GCAGAGAAG 1845 


Qy 

1921 

AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCTATT 

1980 


1 1 1 1 1 1 1 1 1 1 1 II 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! M 1 1 1 1 M 1 M M 1 1 1 1 


Db 

1846 

AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCTATT 

1905 

Qy 

1981 

T GCTT T AGAGACAGGGACT GTAT AAACAAGC CT AAC AT T GGTGCAAAGAT TGCCTCTT GA 

2040 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 

1906 

T GCTT T AGAGACAGGGACT GTAT AAACAAGC CTAAC AT T GGTGCAAAGAT TGCCTCTT GA 

1965 


Qy 2041 ATTAAAAAAAAA 2052 

I I I I I I I I I I 
Db 1966 AAAAAAAAAAAA 1977 


RESULT 8 

US-09-551-853D-5 

; Sequence 5, Application US/09551853D 

; Patent No. 6500667 

; GENERAL INFORMATION: 

; APPLICANT: GURNEY ET AL . 

; TITLE OF INVENTION: ALZHEIMER 1 S DISEASE SECRETASE, APP SUBSTRATES THEREFOR 
AND USES 

; TITLE OF INVENTION: THEREOF 
; FILE REFERENCE: 29915/6280L 

; CURRENT APPLICATION NUMBER: US/09/551 , 853D 
; CURRENT FILING DATE: 2000-04-18 

PRIOR APPLICATION NUMBER: US 60/155,493 

PRIOR FILING DATE: 1999-09-23 
; PRIOR APPLICATION NUMBER: US 09/404,133 
; PRIOR FILING DATE: 1999-09-23 
; PRIOR APPLICATION NUMBER: PCT/US99/2 08 8 1 
; PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: US 60/101,594 
; PRIOR FILING DATE: 1998-09-24 
; NUMBER OF SEQ ID NOS : 73 

SOFTWARE: Patentln version 3.1 
; SEQ ID NO 5 

LENGTH: 1977 
; TYPE: DNA 

; ORGANISM: Homo sapiens 
US-09-551-853D-5 

Query Match 91.2%; Score 1887.2; DB 4; Length 1977; 

Best Local Similarity 96.2%; Pred. No. 0; 

Matches 1974; Conservative 0; Mismatches 3; Indels 75; Gaps 1; 

Qy 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I 

Db 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60. 

Qy 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 120 

Qy 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 18 0 

I | I I I I I I I I I I I I I I I I I I I I I I I II I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I M 

Db 121 CTGCGGCTGCC C CGGGAGAC C GACGAAGAGC C C GAGGAG C C C GG C C GGAGGGGCAGCTT T 180 


Qy 181 GT GGAGAT GGT GGACAACCT GAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGAT GACC 24 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I M I I I I I I I I I I M I 
Db 181 GTGGAGAT GGT GGACAACCT GAGGGGCAAGTCGGGGCAGGGCTACTACGT GGAGAT GACC 24 0 

Qy 241 GT G G G CAGC C C C C C GC AGAC GCT CAACATCCT GGT GGAT ACAGGC AGCAGT AACT T T GC A 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I M I I I M I I I I 
Db 241 GT GGGCAGCCC CCC GCAGACGCT CAACATCCTGGTGGATACAGGCAGCAGTAACTTT GCA 300 

Qy 301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

I I I I I I I I I I I I I I I M I II I II I I I I II I I II I I I I I M I I I I I I I I I I I I I I I I I I I I 

Db 301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

Qy 361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 42 0 

I | I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I 

Db 361 T AC CGGGAC CT CCGGAAGGGT GT GTAT GTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 42 0 

Qy 421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 48 0 

I | | | M I I I I I I I M I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I M I I I I I I I 

Db 421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 480 

Qy 481 GCT GC C AT C AC T GAAT CAGACAAGT T CT T CAT CAAC GGCT C CAACT GGGAAGGCATC CT G 54 0 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 GCT GC CAT C ACT GAAT CAGACAAGT T CT T CAT CAAC GGCT C CAACTGGGAAGGCATC CT G 540 

Qy 541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 541 GGGCTGGCCTATGCTGAGATTGCCAGG 567 

Qy 601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

I I I I I I I I I I I I I I II I I 

Db 568 CTTTGTGGTGCTGGCTTC 585 

Qy 661 CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGGTATC 720 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I M I I I I I I I 

Db 58 6 CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGGTATC 64 5 

Qy 721 GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTATTAT 780 

I I I I I M I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I 

Db 64 6 GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTATTAT 705 

Qy 781 GAGGTCATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGCAAGGAG 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 

Db 7 06 GAGGTGATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGCAAGGAG 765 

Qy 841 TACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCCAAGAAA 900 

I M I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 7 66 TACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCCAAGAAA 825 

Qy 901 GTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCTGAT 960 

I I I I I II I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I 

Db 82 6 GTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCTGAT 885 

Qy 961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 1020 

I | M I I I I II I I M I I I I I II I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I 

Db 886 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 945 


Qy 1021 TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCACC 108 0 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I II I 

Db 946 TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCACC 1005 

Qy 1081 AT CCT T CC GCAGCAAT ACCT GC GGC C AGT GGAAGAT GT GGCCAC GT C C CAAGAC GACT GT 1140 

M II I M I II M I I M M i I I I M M M I M I M II I M M M II II M I II M I M M I 

Db 1006 ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGACTGT 1065 

Qy 1141 TACAAGT T T GC CAT CT CACAGT CAT C C AC GGGCACTGT T AT GGGAGCT GTT AT CAT G GAG 1200 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I II I 
Db 1066 TACAAGT TT GC CAT CT CACAGT CAT C CAC GGGCACT GTTAT GGGAGCT GTTAT CAT GGAG 1125 

Qy 1201 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1260 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I II II I I I I I I I I I I I I I 

Db 1126 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1185 

Qy 1261 CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGACATG 1320 

M I I I I I I I I I II I I I I I I M I I I II I M II I I I I M II I I I I I I I I I M I II I II II I I 
Db 1186 CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGACATG 1245 

Qy 1321 GAAGACT GT GGCTACAACATT CCACAGACAGAT GAGTCAACCCTCATGACCATAGCCTAT 1380 

I I M I I I I I I I I M I II I I I I I II I I I I I I I I I I I I II I I II I I I I II I II I I I I I I I I I 
Db 1246 GAAGACTGT GGCTACAACATT CCACAGACAGAT GAGTCAACCCTCAT GACCATAGCCTAT 1305 

Qy 1381 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 14 40 

I I I I I I I I I I II I I I I I I I M I M M II I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I 
Db 1306 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 1365 

Qy 1441 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 

I II I I I I I I I I II I M M I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I II I I I I 
Db 1366 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1425 

Qy 1501 AAGT GAGGAGGC CCAT GGGCAGAAGAT AGAGATT C C C CT GGAC CAC AC CT C C GT GGT T CA 1560 

M I M I II I I M II I I I I I I I M II I I II I M I I I II II I I I I II I M II M I I I M M i 

Db 1426 AAGT GAGGAGGC CCAT G GG CAGAAGAT AGAGATT C C C CTGGAC CACAC CT CC GT GGT T CA 14 85 

Qy 1561 CTTT GGTCACAAGTAGGAGACACAGAT GGCACCT GT GGCCAGAGCACCTCAGGACCCT CC 1620 

M II I I I I I I I I I I I I I I I I I I I I I I I II I I M I I I I I I I I I II I I M I I M I M I I I II 
Db 148 6 CTTT GGTCACAAGTAGGAGACACAGAT GGCAC CTGT GGCCAGAGCACCTCAGGACCCTCC 1545 

Qy 1621 CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCAG 1680 

I II I I I I II I II I I I I I I I I I I I I I I I II I I I II I M I I I I I II I M I I II II II I I I II 

Db 154 6 CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCAG 1605 

Qy 1681 G GACT GT AC CT GT AG GAAACAGAAAAGAGAAGAAAGAAGCACT CT GCT GGC GGGAAT ACT 17 4 0 

I II I II M I I I II I I I I II I I I II I I II I I M I I I I I I II I I I II I I I I II I I I I I I I I I 
Db 1606 G GACT GT ACCT GT AG GAAACAGAAAAGAGAAGAAAGAAGCACT CT GCT GGC GGGAAT ACT 1665 

Qy 1741 CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCT 18 00 

I I I I I I II I I I I II I I I II I I I I I I I I I I I I I I I I I M II I I I I I I M I 11 I I I I I | I I I 
Db 1666 CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCT 1725 

Qy 1801 TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 1860 

I II I M I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I M I I I I I I I I I I 

Db 1726 TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 1785 


Qy 1861 GT ACT G GCAT CACAC GCAGGT T AC CT T GG C GT GT GT C CCT GT GGT AC C CT GGCAGAGAAG 1920 


Db 


I I I I I I I ( I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I i I [ I I I I I I I I I I I I I I I I 

17 86 GTACT G GCAT CACAC GCAG GT T AC CTT GGC GT GT GT C C CT GT GGT AC C CT GGCAGAGAAG 1845 


Db 


QY 


1921 AGAC CAAGCTT GTT T C C CT GCT GG C CAAAGT CAGTAGGAGAG GAT GCACAGT T T GCT ATT 198 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I 

1846 AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCTATT 1905 


QY 


Db 


1981 T GCTT T AGAGACAG GGACT GT AT AAAC AAGC CT AACATT GGT GCAAAGAT T GC CT CT T GA 2 04 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I M I I I I I I I I I I I 
19 06 T GCT T T AGAGAC AGGGACT GT AT AAACAAGC CTAACATT G GT GCAAAGATT GC CT CT T GA 1965 


QY 


2041 ATTAAAAAAAAA 2052 


Db 


1966 AAAAAAAAAAAA 1977 
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Query Match 89.1%; Score 1843.4; DB 4; Length 2370; 

Best Local Similarity 98.8%; Pred. No. 0; 

Matches 1867; Conservative 0; Mismatches 21; Indels 1; Gaps 1; 

Qy 172 GGCAGCTTTGTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTG 231 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I M I I I I I I I I I I > I M I I I M 
D b 1 GGCAGCTTT GT GGAGAT GGT GGAC AACCT GAGGGGCAAGT CGGGGCAGGGCT ACT ACGT G 60 

Qy 232 GAGAT GACC GT GG G C AG C C C C C C GC AGACGCT CAAC AT C C T GGT GGAT AC AG GC AGC AGT 2 91 

I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I M I I I 
Db 61 GAGAT GACC GT GGGC AGC C C C C C GCAGACGCT CAAC AT C CT G GT GGATAC AGG C AGCAGT 120 

Qy 292 AACTTTGCAGTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTG 351 

I I I M I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 
Db 121 AACTTTGCAGTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTG 180 

Qy 352 TCCAGCACATACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCT^AGTGG 411 

I I I I I I I I I I I I I I I I I I I I I I I I I- I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I 
Db 181 T C CAGCAC AT AC C G GGAC CT C C GGAAGGGT GT GT AT GAGC C CT ACACC C AG GGCAAGT GG 2 40 

Qy 412 GAAGGGGAGCTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGT 471 

I I I I I 1 I I I I I I I I I I I I I I I I I I M I I M I I II II I I I I II I I I I I I I I I I I I I I I I I I 

Db 241 GAAGGGGAGCTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGT 300 

Qy 472 GCCAACATT GCT GCCAT CACT GAATCAGACAAGTT CTT CAT CAACGGCT CCAACT GGGAA 531 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I M I I I 
Db 301 GCCAACATT GCT GCCAT CACT GAATCAGACAAGTT CTT CAT CAACGGCT CCAACT GGGAA 360 

Q y 532 GGCATCCTGGGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTC 591 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i M I I M I M I I 
Db 361 GGCATCCTGGGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTC 420 

Qy 592 TTTGACTCTCTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGT 651 

| | | 1 | | | | | I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I M I I I I I I I I I I I I 

Db 421 TTTGACTCTCTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCAGCTTTGTGGT 480 

Qy 652 GCTGGCTTCCCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATT 711 

I I I I M I I II I I I I I I I I M I I I I I I I I I I I I M I I I I I M I M I I I I I I I I I I I I I I I I 
Db 481 GCTGGCTTCCCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATT 540 

Qy 712 GGAGGT AT C GAC C ACT CGCT GT ACAC AG GC AGT CT CT GGTAT AC AC C CAT C C GGC GGGAG 771 

I I I I I I M I II I I I II I I I I II I I I I I I I I I I I I M I I I I M I I I I I I I I I I I I I 

Db 541 G GAGGT AT C GAC CACT CGCT GT ACAC AGGCAGT CT CT GGT AT AC AC C CAT C C GG C GG GAG 600 

Qy 772 T GGTAT TAT GAGGT CAT CATT GT GC GGGT GGAGAT CAAT GGACAG GAT CT GAAAAT G GAC 831 

I || | I I M I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I M I I I I I I I 
Db 601 T GGTAT TAT GAG GT GAT CAT T GT GC GG GT GGAGAT CAAT GGACAGGAT CT GAAAAT GGAC 660 

Qy 832 T GCAAGGAGTACAACTATGACAAGAGCATT GT GGACAGTGGCACCACCAAC CTT CGTTTG 8 91 

| | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I M I I I I I I I I I I I II I 
Db 661 T GCAAGGAGT ACAACT AT GACAAGAG CAT T GT GGACAGT GGC AC C AC CAAC CT T C GTTT G 720 


Qy 

892 

Db 

721 

Qy 

952 

Db 

781 

Qy 

1012 

Db 

841 

Qy 

1072 

Db 

901 

Qy 

1132 

Db 

961 

Qy 

1192 

Db 

1021 

Qy 

1252 

Db 

1081 

Qy 

1312 

Db 

1141 

Qy 

1372 

Db 

1201 

Qy 

1432 

Db 

1261 

Qy 

1492 

Db 

1321 

Qy 

1551 

Db 

1381 

Qy 

1611 

Db 

1441 

Qy 

1671 

Db 

1501 


C C CAAGAAAGT GT T T GAAGC T GCAGT CAAAT C CAT CAAGGCAG C CT C C T CCAC GGAGAAG 951 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I i I I I I i I I I f I I I I I I I M I I I I I 

C C CAAGAAAGT GTT T GAAGCT GCAGT CAAAT C CAT CAAGG C AGC CT CT C CAC G GGAGAAG 780 

TTCCCTGATGGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCT 1011 

I I I I I I I I I I I I I I I I II I I I II I I I I I I I M I I I I I I I I I I I I I I I I II I I I I M I I I I 

TTCCCTGATGGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCT 840 

T GGAACAT T T T C C CAGT CAT CT CACT CT AC CT AAT GGGT GAGGT T ACC AAC C AGT C CT T C 1071 

I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I M I I I I 

T GGAACAT T T T C C CAGT CAT CT CACT CT AC CT AAT GGGT GAGGTT AC CAAC C AGT C CTT C 900 

C GCAT CAC CAT C CTTC C GCAGCAAT ACCT GC G GC CAGT GGAAGAT GT GGC CAC GT C C CAA 1131 

I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I 

C GCAT CAC CAT C CTT C C G CAGCAATAC C T G C G GC CAGT GGAAGAT GT GGC C AC GT C C CAA 960 

GACGACTGTTACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTT 1191 

| I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

GAC G ACT GT T ACAAGT T T GC C AT CT CAC AGT CAT C CAC GG G CACT GT TAT G GGAGCT GT T 1020 

ATCATGGAGGGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTC 1251 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I 

ATCATGGAGGGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTC 1080 

AGC G CT T GC C AT GT GCAC GAT GAGTT C AG GAC GGCAGC GGT GGAAG GCC CTT T T GT CACC 1311 
| I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AGCGCTTGCCATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACC 114 0 

TTGGACATGGAAGACTGTGGCTAC7^CATTCCACAGACAGATGAGTCAACCCTCATGACC 1371 
I I i I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
TT G GAC AT GGAAGACT GT GG CT ACAAC AT T C C AC AGACAGAT GAGT CAAC C CT CAT GAC C 1200 

ATAGCCTATGTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTG 1431 

I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I 

ATAGCCTATGTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTG 1260 

TGTCAGTGGCGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATC 1491 

I | | | || I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I M I I I I I I I 

TGT CAGT GGCGCTGCCTCCGCTGCCTGCGCCAGACAATGGATGACTTTGCT GAT GACATC 1320 

T CCCT GCT GAAGT GAGGAGGCCCAT GGGCAGAAGAT AGAGATT CCCCT - GGACCACACCT 1550 

I I I I I I I I I I I I I I I I I M II I I M I I I I I I I I I I I I I I I I I I I I I I I I I I 

T CCCTGCTGAAGT GAGGAGGCCCAT GGGAGAAAGAT AGAGATT CCC CT GGGACCACACCT 1380 

CCGTGGTTCACTTTGGTCACAAGTAGGAGACACAGATGGCACCTGTGGCCAGAGCACCTC 1610 

I I I M I I I II I I I I I I II I I I II I I I I I I I I I I II I I I I I M 

C CGT GGTT CACT TT GGT CACAAGTAGGAGACACAGATGGCACCTGT GGC CAGAGCACCT C 1440 

AG GAC C CT CC C CAC C CAC CAAAT GC CT CT GC CTT GAT GGAGAAGGAAAAGGCT GGCAAGG 1670 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I M I I I I I I I I I I M I I I I I I I I I I 

AGGAC C CTC C C CAC C CAC CAAAT G C CT C T GC CTT GAT GGAGAAGGAAAAGGCT GGCAAGG 1500 

T GGGTT CCAGGGACTGTACCTGTAGGAAACAGAAAAGAGAAGAAAGAAGCACT CTGCTGG 1730 

| | | | | | | I I I II I I I I I I I I I I I II I I I I I I II I I I M I I I I I I I M I I I I I I I I I I I I I 

T GGGT T C CAG GGACT GT ACCT GTAGGAAACAGAAAAGAGAAGAAAGAAG CACT CT GCT GG 1560 


Qy 1731 CGGGAATACTCTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAG 17 90 

M I I II I I I I I I I I I I I I I I I I I I I M I I I!' I I I I M M I II I I I I I I I I M I I I I I I I I 
Db 1561 CGGGAATACTCTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAG 162 0 

Qy 17 91 CCCTGAACCTTTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTT 1850 

I | | | | | I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
Db 1621 CCCTGAACCTTTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTT 168 0 

Qy 1851 AGTTTCAGAAGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCT 1910 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I r I I I I I I I I I M I I I I I I I I I I 
Db 1681 AGTTTCAGAAGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCG 174 0 

Qy 1911 GGCAGAGAAGAGAC CAAGCT T GT T T CCCT GCT GGC CAAAGT CAGT AGGAGAGGAT GCACA 197 0 

I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I 
Db 17 41 GGCAGAGAAGAGAC CAAGCT T GTTTCCCTGCT GGC CAAAGTCAGT AGGAGAGGAT GCACA 18 00 

Qy 1971 GT T T GCT AT TT GCT T T AGAGAC AGGGACT GT AT AAACAAGCCTAACAT T GGT GCAAAGAT 2 030 

| | | | | | | | | | | | | | I I II II II II I I I I I I II I I I I II ! I I I I I I I M I I I I I I I I I I I I 
Db 18 01 GTT T GCT AT TT GCT T T AGAGACAGGGACT GT AT AAACAAGCCTAACAT T GGT G CAAAGAT 18 60 

Qy 2031 TGCCTCTTGAATTAAAAAAAAAAAAAAAA 2059 

I I I I I I I I I I I M I I I I I I I I I I I I I 

Db 18 61 TGCCTCTT GAAT T AAAAAAAAAAACT AGA 18 89 
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1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

I I I I I I I III II I I I I I I I III I I I I I M I I I I I I I I I I I 

1 ATGGCCCCAGCGCTGCACTGGCTCCTGCTATGGGTGGGCTCGGGAATGCTGCCTGCCCAG 60 

61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 120 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I H I I I 

61 GGAACCCATCTCGGCATCCGGCTGCCCCTTCGCAGCGGCCTGGCAGGGCCACCCCTGGGC 12 0 

121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 18 0 

Ml I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I II I I I I I I I I I I I I I I 

121 CTGAGGCTGCCCCGGGAGACTGACGAGGAATCGGAGGAGCCTGGCCGGAGAGGCAGCTTT 18 0 

181 GT GGAGAT G GT GGACAACCT GAGGGGCAAGT C GGG GC AGGGCT ACT AC GT GGAGAT GAC C 24 0 

I I I I I I I I I I I I I I I I I I I II II I Mill I I I I I I I I I I I I 

181 GT GGAGAT GGT GGACAACCT GAGGGGAAAGT CC GGC CAG GGC TACT AT GT G GAGAT GAC C 24 0 

241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 300 

II I I M I I I I I I I I I I I I I I I I I I I I I I I M I I I I I M I I I M I I I I I I I I I I I I 

241 GTAGGCAGCCCCCCACAGACGCTCAACATCCTGGTGGACACGGGCAGTAGTAACTTTGCA 300 

301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

301 GTGGGGGCTGCCCCACACCCTTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 420 

II || I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I M I I I I M I II I I I I I I M I 

361 T AT CGAGACCT CCGAAAGGGT GT GT AT GT GC C CT AC AC C CAGGGCAAGT G G GAGGGGGAA 42 0 

421 CT GGGCACCGAC CT GGT AAGC AT CC C C CAT GGCC C CAAC GT CAC T GT G C GT GC CAAC AT T 4 80 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I M II 
421 CTGGGCACC GACCTG GT GAGC AT C C CT CAT GGC C C CAAC GT C ACT GT GC GT GC CAACAT T 480 

481 GCT GCCAT CACT GAAT CAGACAAGT T CT T CAT CAAC GGCT C CAACT GG GAAGGC AT C C T G 54 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I M I 
481 GCT GCCAT CACT GAAT CGGACAAGTT CTT CAT CAAT GGT T C CAACT GGGAGGGCAT CCT A 54 0 

541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

I | | | I | I I I I I I I I I I I M I I I M I I I I I I I I I I I I I I I I I I M I I I I I M I I II 

541 GGGCTGGCCTATGCTGAGATTGCCAGGCCCGACGACTCTTTGGAGCCCTTCTTTGACTCC 600 

601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I M I I I II Mill I M I I I I I I 
601 CTGGTGAAGCAGACCCACATTCCCAACATCTTTTCCCTGCAGCTCTGTGGCGCTGGCTTC 660 

661 CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGGTATC 72 0 

I | | | | | | | M II I II I I I M I I I I II I II I II II I I M I I I I II I I I II I II 

661 CCCCTCAACCAGACCGAGGCACTGGCCTCGGTGGGAGGGAGCATGATCATTGGTGGTATC 72 0 

721 GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTATTAT 7 80 

I I I I I I I I I M I I I I I I I I II II I I I I I I I I II I M I I II I I II I II I I I II I I I I I 

721 GACCACTCGCTATACACGGGCAGTCTCTGGTACACACCCATCCGGCGGGAGTGGTATTAT 7 80 

781 GAGGTCATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGCAAGGAG 84 0 

II M I I I I I I I I II I II I I I I I I I II I M II I I I M II I I I I I I I II I I II 

781 GAAGTGAT CATT GT ACGT GT GGAAAT CAAT GGT CAAGATCT CAAGAT GGACT GCAAGGAG 840 

841 TACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCCAAGAAA 900 


Db 

841 

Qy 

901 

Db 

901 

Qy 

961 

Db 

961 

Qy 

1021 

Db 

1021 

Qy 

1081 

Db 

1081 

Qy 

1141 

Db 

1141 

Qy 

1201 

Db 

1201 

Qy 

1261 

Db 

1261 

Qy 

1321 

Db 

1321 

Qy 

1381 

Db 

1381 

Qy 

1441 

Db 

1441 

Qy 

1501 

Db 

1501 

Qy 

1561 

Db 

1561 

Qy 

1619 

Db 

1621 

Qy 

1679 


1 1 1 1 1 II I 1 1 1 1 1 1 1 1 1 11 1 1 II II 1 1 1 1 1 1 1 

T ACAACT AC GACAAGAGCAT T GT GGACAGT GGGAC C AC CAAC CT T CGCTTGCC CAAGAAA 900 

GT GT T T GAAGCT GCAGT CAAAT C CAT CAAGGC AGC CT C CT C C AC GGAGAAGT T C C CT GAT 960 

II I I I I I I II I I I I I I I I I I I I I I I I I I M I I I I I I I I M I I I I M I I I I M I I I 

GTATTTGAAGCTGCCGTCAAGTCCATCAAGGCAGCCTCCTCGACGGAGAAGTTCCCGGAT 960 

GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 1020 

|| || I I I I I I I I I I I I II I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I 

GGCTTTTGGCTAGGGGAGCAGCTGGTGTGCTGGCAAGCAGGCACGACCCCTTGGAACATT 1020 

TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCACC 1080 

I I I I I I I I I I I MMI I I I I I I I I I I I I I II I I I I I M I I M II II II II II II 

T T C C C AGT C ATT T CACT T T AC CT CATGG GT GAAGT C AC CAAT C AGT C CT T C C GCATCACC 1080 

AT C CT T C C GC AGCAAT ACCT GC GGC CAGT G GAAGAT GT GGC C AC GT C C CAAGACGACT GT 1140 

I || | I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I M I 

AT C CT T C CT CAGCAAT AC CT ACGGCC GGT GGAGGAC GT G GC C AC GT C C CAAGACGACT GT 1140 

TACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATCATGGAG 1200 
| I I I M I I II I II M I I I M I I I I I I I I I I I I I I M I I I M M I I I I I I I I I I I 
TACAAGTT C GCT GT CT CACAGTCATCCACGGGCACT GTTAT GGGAGCCGT CATCATGGAA 12 00 

GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 12 60 

II | | I I I II Mill Mill II M II I I I I I I M I II M II I II II II I I I II II 

GGTTTCTATGTCGTCTTCGATCGAGCCCGAAAGCGAATTGGCTTTGCTGTCAGCGCTTGC 12 60 

CAT GT GCAC GAT GAGT T CAGGAC GGC AGC GGT GGAAGGC C CT T T T GT C ACCTT GGACAT G 1320 

I I I II M I II II I II II II II I I I I II M II I II IMIM 

CAT GT G C AC GAT GAGTT CAGGAC GGC GG CAGT GGAAGGT C C GT T T GT T ACGGCAGACAT G 132 0 

GAAGACT GT GGCT ACAACAT T C C ACAG AC AGAT GAGT CAAC C CT CAT GACCAT AGC CT AT 1380 

I | I M I M I I I I I I I I I I I I I M I I I I I I I I I II I I I I IE II I I I I I I I I I I I I I I I 

GAAGACT GT GGCTACAACATT C C C CAGAC AGAT GAGT CAACACTT AT GACCAT AGC CT AT 138 0 

GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 144 0 

I I I I I I I I I I I I I I I I I I I M I M M I I I I I I I I I I I I I I I I I M I M I I M I I M I 

GTCATGGCGGCCATCTGCGCCCTCTTCATGTTGCCACTCTGCCTCATGGTATGTCAGTGG 14 40 

CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 

| I I I I I I I II I I II I II II I I I I I I I I I M I I I I I I I I I I I I I I I I I I M I II I I 

CGCTGCCTGCGTTGCCTGCGCCACCAGCACGATGACTTTGCTGATGACATCTCCCTGCTC 1500 

AAGT GAGGAGGCCCAT GGGCAGAAGATAGAGATT CCCCT GGACCACACCT CCGTGGTT CA 1560 

INI M I II II I I II I II I I III MM II I I I M II II I I II II II II I 

AAGTAAGGAGGCTCGTGGGCAGATGATGGAGACGCCCCTGGACCACATCTGGGTGGTTCC 1560 
CT TT GGT CACA- - AGT AG GAGACACAGAT GGC AC CT GT GGCCAGAGC AC CT CAGGAC C CT 1618 

I II II II I I II I I I MM I Mill M II II II I II II I I II II II I I M II I 

CT T T GGT CACAT GAGT T GGAGCT AT G GAT GGT AC CT GT GGCCAGAGCAC CT CAGGAC C CT 162 0 
CCCCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCC 167 8 

I I II I I I I M I I I II I I I II Ml MM I II II I I I II I I I 

CACCAACCTGCCAATGCTTCTGGCGTGACAGA-ACAGAGAAATCAGGCAAGCTGGATTAC 167 9 
AGGGACT GT AC C T GT AGGAAAC AGAAAAGAGAAGAAAGAAGCACT C T GCT GGC GGGAAT A 17 38 

M II I I II II I M I II II M I II II I I II I I II II I I II II I I M II 


Db 

1680 

AGGGCTTGCACCTGTAGGACACAGGAGAGGGAAGGAAGCAGCGTTCTGGTGGCAGGAATA 

1739 

Qy 

1739 

CTCTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAAC 

iii i i i i i iiii ii iii i i i i i i i i i t I I I l I l l l l l 1 l 1 1 1 1 1 1 II 1 
Ml 1 III! IIII II III 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MINIMUM 1 

T C CT TAGGCAC C ACAAACT T GAGT - T GGAAAT TTTGCTGCTT GAAG CT T CAGCC CT GAC C 

1798 

Db 

1740 

1798 

Qy 

Db 

1799 
1799 

CTTTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAG 

ii ii iii it i i i t i i i i i i i i i i i i i i 1 1 1 1 1 1 1 1 1 1 1 II II III 
|| || Ml II 1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 1 1 II 1 M 1 1 1 1 II M M > 

CTCTGCCCAGCA-TCCTTTAGAGTCTCCAACCTAAAGTATTCTTTATGTC CTTCCAG 

1858 
1854 

Qy 

1859 

AAGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGA 
iii ii i i i i iiii iii i i l l l 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 I 

| I I I 1 1 1 1 1 1 Ml II IIII IIII Ml II 1 M M II 1 1 11 II 1 1 II 1 1 M 1 1 M 

AAGTACTGGCGTCATACTCAGGCTACC-CGGCATGTGTCCCTGTGGTACCCTGGCAGAGA 

1918 

Db 

1855 

1913 

Qy 

1919 

AGAGAC CAAGCT T GTT TCCCTGCT GGCCAAAGT C AGT AGGAGAGGAT GC ACAGT T T GCT A 

l 1 IIII 1 1 1 I l 1 1 M I I I I I I I 1 1 1 1 1 I 1 II III 1 II II 1 II 1 1 1 
I I IIII 1 1 1 1 M M II 1 1 M 1 1 1 1 1 1 M 1 M Ml 1 II i i i i i i i i 

AAGGGCCAATC-TCATTCCCTGCTGGCCAAAGTCAGCAGAAGTyVGGTG— AAGTTTGCCA 

1978 

Db 

1914 

1970 

Qy 

1979 

TTTGCTTTAGAGACAGGGACTGTATAAACAAGCCTAACATTGGTGCAAAGATTGCCTCTT 

MM II M 1 1 1 1 II 1 1 M 1 II M Ml 1 1 II 1 II 1 II III MM 

GTTGCTTTAGTGATAGGGACTGCAGACTCAAGCCT-ACACTGGTACAAAGACTGCGTCTT 

2038 

Db 

1971 

2029 

Qy 

2039 

GAATTAAAAAAAAA 2 052 

II II 1 1 1 1 1 1 

GAGATAAACAAGAA 2 043 


Db 

2030 
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Qy 

1 

Db 

1 

Qy 

61 

Db 

61 

Qy 

121 

Db 

121 

Qy 

181 

Db 

181 

Qy 

241 

Db 

241 

Qy 

301 

Db 

301 

Qy 

361 

Db 

361 

Qy 

421 

Db 

421 

Qy 

481 

Db 

481 

Qy 

541 

Db 

541 

Qy 

601 

Db 

601 

Qy 

661 

Db 

661 

Qy 

721 

Db 

721 

Qy 

781 

Db 

781 


ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

I I I I I I I III I I I I I I II I I I I I I I I III I I I I I I I I I I I I I I I I M I I I I I 

ATGGCCCCAGCGCTGCACTGGCTCCTGCTATGGGTGGGCTCGGGAATGCTGCCTGCCCAG 60 

GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 12 0 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 

GGAACCCATCTCGGCATCCGGCTGCCCCTTCGCAGCGGCCTGGCAGGGCCACCCCTGGGC 12 0 

CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 180 

Ml I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 

CTGAGGCTGCCCCGGGAGACTGACGAGGAATCGGAGGAGCCTGGCCGGAGAGGCAGCTTT 18 0 

GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGATGACC 2 40 

I I M I I I Ml II I I I II I I I I I I I I I I I I I I I I I I 

GT GGAGAT GGT GGACAAC C TGAGGGGAAAGT C C G GC CAGGGCT ACT AT GT G GAGAT GAC C 24 0 

GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 300 

|| I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I II I I I I I I I I I I I I I I I I I 

GTAGGCAGCCCCCCACAGACGCTCAACATCCTGGTGGACACGGGCAGTAGTAACTTTGCA 300 

GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 3 60 

| I I I I I I I I I I I I I I I I I I I I I I I II I I M I I I I I I I I I i I I I I I I I I I I I I I I I I I 

GTGGGGGCTGCCCCACACCCTTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

TACCGGGACCTCCGGT^AGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 42 0 
|| || I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
TAT C GAGAC CT CC GAAAGGGT GT GT AT GT GC C CT AC AC C C AGG GCAAGT G G GAGGG GGAA 420 

CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 4 8 0 

M | | I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I M I I I I I I I M 

CTGGGCACCGACCTGGTGAGCATCCCTCATGGCCCCAACGTCACTGTGCGTGCCAACATT 4 80 

GCTGCCATCACTGAATCAGAC7U\GTTCTTCATCAACGGCTCCAACTGGGAAGGCATCCTG 540 

I | | | | || | I I I I I I I I I I I I II I I I I I M I I I I I M I I I I I I I I M I I I I I I I I I 

GCT GCCAT CACT GAAT CGGACAAGTT CTT CAT CAAT GGTT CCAACTGGGAGGGCAT CCTA 54 0 

GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

I [ M | | | | M I I I I I I M I I I II II I I I I I I I I I I I I I I I I I I I I I I II II I I I I 

GGGCTGGCCTATGCTGAGATTGCCAGGCCCGACGACTCTTTGGAGCCCTTCTTTGACTCC 600 

CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

I M I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I 

CTGGTGAAGCAGACCCACATTCCCAACATCTTTTCCCTGCAGCTCTGTGGCGCTGGCTTC 660 

C C C CT C AAC CAGT C T GAAGT GCT GGCCTCTGTC GGAG G GAGC AT GAT CATT GGAGGT AT C 72 0 

I I I I I I I I I I I I I II I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I 

C C CCT CAAC CAGACC GAGG C ACT GG C CT C GGT GG GAGGGAG CAT GAT CAT T G GT GGT AT C 720 

GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTATTAT 7 8 0 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I 

GACCACTCGCTATACACGGGCAGTCTCTGGTACACACCCATCCGGCGGGAGTGGTATTAT 7 8 0 

GAGGT CAT CAT T GT GC G GGT GGAGAT CAAT GGAC AGGAT CT GAAAAT GGACT GCAAG GAG 8 4 0 

II II I II I I I I I II I I I I I MINIM II I I I I I II I I I I I I I I I I I I I I I 
GAAGT GAT CAT T GT AC GT GT GGAAAT CAAT GGT CAAGAT CT CAAGAT GGACT GCAAGGAG 84 0 


Qy 

841 

Db 

841 

Qy 

901 

Db 

901 

Qy 

961 

Db 

961 

Qy 

1021 

Db 

1021 

Qy 

1081 

Db 

1081 

Qy 

1141 

Db 

1141 

Qy 

1201 

Db 

1201 

Qy 

1261 

Db 

1261 

Qy 

1321 

Db 

1321 

Qy 

1381 

Db 

1381 

Qy 

1441 

Db 

1441 

Qy 

1501 

Db 

1501 

Qy 

1561 

Db 

1561 

Qy 

1619 

Db 

1621 


T ACAACT AT GACAAGAGCAT T GT GGAC AGT G GC AC C AC CAAC CT T C GT TT G C C CAAGAAA 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I II I I I I I I 

T ACAACT AC GACAAGAGCATT GT GGAC AGT GGGAC CAC CAAC CT T CGCTTGCC CAAGAAA 900 

GTGTTTGAAGCTGCAGTCTWVTCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCTGAT 960 

II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I 

GT AT T T GAAGCT GC C GT CAAGT CCAT CAAG GC AGC CT C CT C GAC GGAGAAGT T C C C GGAT 960 

GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 1020 

II || I I I I II I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I M I I 

GGCTTTTGGCTAGGGGAGCAGCTGGTGTGCTGGCAAGCAGGCACGACCCCTTGGAACATT 1020 

T T C C C AGT CAT CT C ACT CT ACCTAAT GGGT GAGGT T AC CAAC C AGT C CT T C C GCAT CACC 1080 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
T T C C C AGT CAT T T CACT T T AC CT CAT GGGT GAAGT CAC CAAT CAGT C CT T C C GC AT CACC 1080 

AT C CT T C C GCAGCAAT AC CT GCGGCC AGT GGAAGAT GT GGC CAC GT C C CAAGAC GACT GT 1140 

I I I II I I I I I I I I MM I M I M I I I M I M II MM 

AT C C TT C CT CAGCAAT AC CT ACG GCCGGT GGAGGAC GT GGC CAC GT C C CAAGAC GACT GT 1140 
TACAAGTTT GCCAT CT CACAGTCATCCACGGGCACT GTTAT GGGAGCT GTT ATCATGGAG 1200 

M M I I II II II I I I I M M M II M II I I II I I M I I M I I I I II I II II M I 

T ACAAGTT CGCT GT CT CACAGTCATCCACGGGCACT GTTAT GGGAGC CGT CAT CAT GGAA 1200 

GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1260 

I | Mill II I II I I Mill I I I M II I II I I II I M M I M I II II II I I M II 

GGTTTCTATGTCGTCTTCGATCGAGCCCGAAAGCGAATTGGCTTTGCTGTCAGCGCTTGC 12 60 

CAT GT G CAC GAT GAGT T CAGGAC GG CAGCGGT G GAAGGC C CT TT T GT C AC CT T GGACAT G 1320 

M I I I I M I II I I I I I I I M I II II I M II M II II II I I I I I II II M II 

CAT GT GC AC GAT GAGT T CAG GAC GGCGGCAGT GGAAG GT C C GT T T GT T AC GGC AGACAT G 1320 
GAAGAC T GT G GCT ACAACAT T C CACAG ACAGAT GAGT CAAC C CT CAT GAC CAT AGC CT AT 1380 

M I M I II II I I I I I I I I M M I II I II II II I I M I II I II II II I I II i II I M I 

GAAGACT GT GG CT ACAACAT T C C C C AGACAGAT GAGT CAAC ACT TAT GAC CAT AGC CT AT 1380 
GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 1440 

M I I II M II II I I I I II I I I M II M II II I I II II I M II I M I M I M II I II I 

GTCATGGCGGCCATCTGCGCCCTCTTCATGTTGCCACTCTGCCTCATGGTATGTCAGTGG 1440 

CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 

| I I I I I II || || I | I M II II I I M I II II II II I II I M II II II I I M I I II I 

CGCTGCCTGCGTTGCCTGCGCCACCAGCACGATGACTTTGCTGATGACATCTCCCTGCTC 1500 

AAGT GAGGAGGC C CAT GGGCAGAAGAT AGAGATT CCCCT GGAC C ACAC CT C C GT GGT T C A 1560 

MM II I II M I I I I I II II Ml I II I II II M II II II I II I I I I II I 

AAGTAAGGAGGCTCGTGGGCAGATGATGGAGACGCCCCTGGACCACATCTGGGTGGTTCC 15 60 
CTTT GGT C ACA- - AGT AGGAGACAC AG AT G G C AC CT GTGGC C AGAGCAC C T CAGGAC C CT 1618 

I II M I I II II III MM I I II I I II I I II I I II I II I I I II II I M I M II 

CT TT GGT CAC AT GAGT T GGAGCT AT GGAT GGT AC CT GT GGC CAGAGCAC CT CAGGAC C CT 1620 
CCCCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCC 167 8 

I II I II I II II I I I I I I II I II I I I II I II II I I I I I M I 

CAC CAACCT GC CAAT GCT T CT GGC GT G AC AGA- AC AGAGAAATCAGG CAAGCT GGAT T AC 1679 


Qy 1679 AGG GAC T GT AC CT GT AGGAAACAGAAAAGAGAAGAAAGAAGCACT CT GCT GGC GGGAAT A 1738 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I M I 
Db 1680 AGGGCTTGCACCTGTAGGACACAGGAGAGGGAAGGAAGCAGCGTTCTGGTGGCAGGAATA 1739 

Qy 1739 CTCTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAAC 1798 

Ml | | I I I M I I I I III I i I I I I I I I I I I I I I I I I I I I II I I I I I I I I 

Db 17 4 0 TCCTTAGGCACCACAAACTTGAGT-TGGAAATTTTGCTGCTTGAAGCTTCAGCCCTGACC 17 98 

Qy 17 99 CTTTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAG 1858 

I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I III I I I I I 

Db 1799 CTCTGCCCAGCA-TCCTTTAGAGTCTCCAACCTAAAGTATTCTTTATGTC CTTCCAG 1854 

Qy 18 5 9 7VAGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGA 1918 

I I I I I I I I I I Ml II I I I I I I I I III I I I I I I I I I I I I I I I I I I I II I I I I I I 

Db 1855 AAGTACTGGCGTCATACTCAGGCTACC-CGGCATGTGTCCCTGTGGTACCCTGGCAGAGA 1913 

Qy 1919 AGAGAC CAAGCT T GT T T C C CT GCT GGC CAAAGT CAGT AGGAGAGGAT GC AC AGT T T GCT A 197 8 

I | I I I I I I M I I I I I I M I I I I I I I I M I I I I I I I I I I I I I I I I I 

Db 1914 AAGGGCCAATC-TCATTCCCTGCTGGCCAAAGTCAGCAGAAGAAGGTG— AAGTTTGCCA 1970 

Qy 1979 T T T GCT TT AGAGACAGGGACT GT AT AAACAAGCCT AACATTGGT GCAAAGAT TGC CT CT T 2038 

MINIM! II II I I I I I I I I II I I I I I III I I I I I I I I II III I I I I 

Db 1971 GTTGCTTTAGTGATAGGGACTGCAGACTCAAGCCT-ACACTGGTACAAAGACTGCGTCTT 2 02 9 

Qy 2039 GAATTAAAAAAAAA 2052 

II I I I I I I I I 

Db 2030 GAGATAAACAAGAA 2043 


RESULT 12 
US-09-551-853D-7 

; Sequence 7 , Application US/09551853D 

; Patent No. 6500667 

; GENERAL INFORMATION: 

; APPLICANT : GURNEY ET AL . 

; TITLE OF INVENTION: ALZHEIMER 1 S DISEASE SECRETASE, APP SUBSTRATES THEREFOR 
AND USES 

; TITLE OF INVENTION: THEREOF 

FILE REFERENCE: 29915/6280L 
; CURRENT APPLICATION NUMBER: US/ 09/ 551 , 853D 
; CURRENT FILING DATE: 2000-04-18 

PRIOR APPLICATION NUMBER: US 60/155,493 

PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: US 09/404,133 
; PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: PCT/US99/20881 

PRIOR FILING DATE: 1999-09-23 
; PRIOR APPLICATION NUMBER: US 60/101,594 
; PRIOR FILING DATE: 1998-09-24 
; NUMBER OF SEQ ID NOS : 7 3 
; SOFTWARE: Patentln version 3.1 
; SEQ ID NO 7 

LENGTH : 2 04 3 
; TYPE: DNA 
; ORGANISM: Mus musculus 
US-09-551-853D-7 


Query Match 75.0%; Score 1552.4; DB 4; Length 2043; 

Best Local Similarity 87.6%; Pred. No. 0; 

Matches 1800; Conservative 0; Mismatches 241; Indels 13; Gaps 


Qy ' 

i 

ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 

60 


t i i i i i i iii i i i i i i i i i i i i i i i i tii i t i i i i i i i i i i i i i i i i i i I l l 
I II 1 1 1 1 III 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 III (MM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 

l 

ATGGCCCCAGCGCTGCACTGGCTCCTGCTATGGGTGGGCTCGGGAATGCTGCCTGCCCAG 

60 

Qy 

61 

GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 

120 


ii i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i ii i i i i i i i i i 
II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 M 1 1 1 M 


Db 

61 

GGAACCCATCTCGGCATCCGGCTGCCCCTTCGCAGCGGCCTGGCAGGGCCACCCCTGGGC 

120 

Qy 

121 

CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 

180 


i i i i i i i i i i i i i i i i i ii i i i i i i i i i i i i i i i i i i i i i i i i i i 

I I I | | | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 


Db 

121 

CTGAGGCTGCCCCGGGAGACTGACGAGGAATCGGAGGAGCCTGGCCGGAGAGGCAGCTTT 

180 

Qy 

181 

GT GGAGAT GGT GGACAACCT GAGGGGCAAGT CGGGGCAGGGCTACT ACGT GGAGAT GACC 

240 


■ > > i i * i i i i i i i i i i i i i i i i i i i i i i i > i ii i i i i i i i i i i i i i i i i i i i i i i i 
I | | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 

181 

GT GGAGAT GGT GGACAAC CT GAGG GGAAAGT CC GGCCAGGGCT ACT AT GT GGAGAT GAC C 

240 

Qy 

241 

GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 

300 


ii i i i i i t i i i i i i i i i i i i i i i t i i i i i i < i i i i i ii i i i i i i i i i i i i i i i i i 
II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 

241 

GTAGGCAGCCCCCCACAGACGCTCAACATCCTGGTGGACACGGGCAGTAGTAACTTTGCA 

300 

Qy 

301 

GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 

360 


.■I.. tii i t i i i i i i i i i i i i i i i i i i i i i i i t t t i i t i i i i i i i \ i i i i i 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 


Db 

301 

GTGGGGGCTGCCCCACACCCTTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 

360 

Qy 

361 

TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 

420 


■ I ii i i i i i t i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 

II II MINIM 1 1 1 1 1 1 M 1 1 1 1 II 1 II II M 1 1 1 1 1 II 1 1 1 II 1 1 M 1 1 1 1 1 1 


Db 

361 

TATCGAGACCTCCGAAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAGGGGGAA 

420 

Qy 

421 

CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 

480 


i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i t i i i i 
1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 M 1 1 M II 1 1 M 1 1 1 1 M 1 1 1 II II 1 


Db 

421 

CTGGGCACCGACCTGGTGAGCATCCCTCATGGCCCCAACGTCACTGTGCGTGCCAACATT 

480 

Qy 

481 

GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATCCTG 

540 


i i i i i i i i i i i i i i i i i i i i i I i I I I I I I l I I I 1 11 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 I I 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 M II M 1 1 1 1 1 M II 1 M M 1 I 1 1 1 1 II 1 1 1 1 1 1 


Db 

481 

GCTGCCATCACTGAATCGGACAAGTTCTTCATCAATGGTTCCAACTGGGAGGGCATCCTA 

540 

Qy 

541 

GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 

600 


i i i i i i i i t i i i i i i i i i i i i i i t i i i i i i i i i i i i i i i i i i i i i i i i I I I i i I I 
1 1 II II 1 M 1 1 1 1 1 1 II 1 1 1 II 1 I II II 1 M I M 1 1 1 II M 1 M 1 II M 1 1 II II 


Db 

541 

GGGCTGGCCTATGCTGAGATTGCCAGGCCCGACGACTCTTTGGAGCCCTTCTTTGACTCC 

600 

Qy 

601 

CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 

660 


i i i i i i i i i i i i i i i i i i t i i \ i i i i i t i i i i i i i i i ii I I I I l t l l l l l l l 1 
Mill M II 1 1 1 1 1 1 1 1 1 II 1 1 II 1 Mil I 1 I 1 II II M Mill II 1 II II II 


Db 

601 

CTGGTGAAGCAGACCCACATTCCCAACATCTTTTCCCTGCAGCTCTGTGGCGCTGGCTTC 

660 

Qy 

661 

CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGGTATC 

720 


1 II 1 1 1 M 1 1 II 1 II 1 II 1 1 II 1 1 II 1 1 1 1 II 1 II 1 1 1 1 1 1 1 II 1 1 MINI 


Db 

661 

C C C CT CAAC CAGAC C GAGGC ACT GGC CT CG GT GGGAGGGAGC AT GAT CAT T GGT GGT AT C 

720 

Qy 

721 

GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTATTAT 

780 


1 I I I I II 1 II 1 Mill 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 


Db 

721 

GAC C ACT C GCT AT ACAC GGGC AGT CT CT GGT AC ACACC C AT C C GGC GGG AGT GGT AT TAT 

780 

Qy 

781 

GAGGT CAT CAT T GT GC GGGT GGAGAT CAAT GGACAGGAT CT GAAAAT G GAC T GCAAGGAG 

840 


Db 

781 

Qy 

841 

Db 

841 

Qy 

901 

Db 

901 

QY 

961 

Db 

961 

Qy 

1021 

Db 

1021 

Qy 

1081 

Db 

1081 

Qy 

1141 

Db 

1141 

Qy 

1201 

Db 

1201 

Qy 

1261 

Db 

1261 

Qy 

1321 

Db 

1321 

Qy 

1381 

Db 

1381 

Qy 

1441 

Db 

1441 

Qy 

1501 

Db 

1501 

Qy 

1561 

Db 

1561 

Qy 

1619 


M II I I I I I II I II I I I I I MIIMM II I I I II II I I I I I I I I I II I II I 
GAAGT GAT CATTGT ACGT GTGGAAAT CAAT GGTCAAGATCTCAAGATGGACT GCAAGGAG 840 

T ACAACT AT GACAAGAGC ATT GT G GAC AGT GGCACC AC CAAC CTT CGTT T G C C CAAGAAA 900 

III I I I I I I I I I I I I I I I I I I I I I I I I I I Ill I I I I I I I I I I I I 

T ACAACT AC GACAAGAGCAT T GTGGAC AGT GG GAC C AC CAAC CT T C GC TT GC C CAAGAAA 9 00 

GT GT T T GAAGCT GC AGT CAAAT C CAT CAAGGCAGCCT C CT C C AC GGAGAAGT T C C CT GAT 960 

II I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 

GT AT TT GAAG CT GC C GT CAAGT CCAT CAAGGCAGCC T C CT C GAC GGAGAAGTT C CC GGAT 960 

GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 102 0 

|| || I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 

GGCTTTTGGCTAGGGGAGCAGCTGGTGTGCTGGCAAGCAGGCACGACCCCTTGGAACATT 1020 

T T C C C AGT CAT CT C ACT CT AC CT AAT G GGT GAGGTT AC CAAC CAGT CCT T C C GCAT CACC 108 0 

I I I I I I I I I I I Mill I I I I I III II I I I I I I I I I I I I I I I I I I 

T T CC CAGT CATTT C ACT T T AC C TC AT G G GT GAAGT C AC CAAT CAGT C CT T C C GCAT CACC 1080 

AT C CT T C C GC AGCAAT AC CT GC GGC CAGT GGAAGAT GT GGC C AC GT C CC AAGAC GACT GT 1140 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I M E I I I I I I I I I I I I I I I II I I I 

AT C CT T C CT CAGCAAT AC CT AC GGC C G GT GGAGGACGT GGC C AC GT C CCAAGAC GACT GT 114 0 

TACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATCATGGAG 12 00 

I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I II I I I II I M I I I I I I I I 
TACAAGTT CGCTGT CT CACAGT CATCCACGGGCACTGTT AT GGGAGC CGT CAT CAT GGAA 1200 

GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 12 60 

II I I I I I M Mill II I I I I I I II I I I I I I I I I I I II I I I I M I I I I I I I I I I I 

GGTTTCTATGTCGTCTTCGATCGAGCCCGAAAGCGAATTGGCTTTGCTGTCAGCGCTTGC 1260 
CAT GT GC AC GAT GAGT T C AGGAC G GC AGC G GT G GAAGGC C CT T T T GT C ACCT T GGAC AT G 1320 

I I I I I I I I II II M I I I I I I I I I M I II I I I I I I I I II Mill II I I I I I I 

CAT GT GC AC GAT GAGT T C AGGAC GGC G GC AGT GGAAGGT CC GT T T GT T ACGGCAGAC AT G 1320 
GAAGACT GT GGCT ACAAC ATT C CAC AGAC AGAT GAGT CAAC CCT CAT GACC AT AG C CT AT 1380 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I M I I II I M I I I II I I M I I I 

GAAGACT GT GGCTACAACATT CCCCAGACAGAT GAGTCAACACTT AT GACCATAGCCT AT 138 0 

GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 1440 

I I I I I I I I III I II I I II I I I I I I I I M I I I I I I I I I I I I I M I I I I I I I I I II II I 

GTCATGGCGGCCATCTGCGCCCTCTTCATGTTGCCACTCTGCCTCATGGTATGTCAGTGG 144 0 

CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 

I I I I I M I M I I I I I I I I I I I Mill I I I II I I I I I I I II I I M II II II I I M I 

CGCTGCCTGCGTTGCCTGCGCCACCAGCACGATGACTTTGCTGATGACATCTCCCTGCTC 1500 

AAGT GAG GAG GC C CAT GGG CAGAAGAT AGAGAT T C C C CT GGAC CACAC CT CC GT GGT T C A 1560 

Mil I I I I I I I I MIIMM III II I I II II M I I M M I II M I I II I 

AAGT AAGGAGGCT C GT GGGC AGAT GAT GGAGAC GC C C CT GGAC CAC AT CT GGGT G GT T C C 1560 
CTT T GGT CAC A — AGT AGGAGAC AC AGAT GGC AC CT GT GGC CAGAGC AC CT CAGGAC CCT 1618 

I I I II I I I I I I III MM I Mill I I II II II II I I II II II II M II M I I 

CTTTGGTCACATGAGTTGGAGCTATGGATGGTACCTGTGGCCAGAGCACCTCAGGACCCT 1620 

CCCCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCC 1678 
I I I I I I I I I I II I I II I II I III II II I I I M I I I II I I I 


Db 


1621 CACCAAC CT G C CAAT GCT T CT GG C GT GACAGA- ACAGAGAAAT C AGGCAAGCT GGAT T AC 167 9 


Qy 1679 AGGGACT GT AC CT GT AGGAAACAGAAAAGAGAAGAAAGAAGCACT CT GCT GGC GGGAAT A 1738 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I > I 

Db 168 0 AGGGCTTGCACCTGTAGGACACAGGAGAGGGAAGGAAGCAGCGTTCTGGTGGCAGGAATA 17 39 

Qy 1739 CTCTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAAC 17 98 

Ml | | | | | | | | | II III MUM I I I I I I I I I I M I I II I I I I I I I I I 

Db 1740 TCCTTAGGCACCACAAACTTGAGT-TGGAAATTTTGCTGCTTGAAGCTTCAGCCCTGACC 17 98 

Qy 17 99 CTTTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAG 1858 

I | | | I I I I I II I II I I I I I I I I M I I I I I I I I I I I I I I I > I I I I I 

Db 1799 CTCTGCCCAGCA-TCCTTTAGAGTCTCCAACCTAAAGTATTCTTTATGTC CTTCCAG 1854 

Qy 1859 AAGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGA 1918 

I | || || M I I III II I M I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1855 AAGTACTGGCGTCATACTCAGGCTACC-CGGCATGTGTCCCTGTGGTACCCTGGCAGAGA 1913 

Qy 1919 AGAGAC CAAGCTT GT T T C C CT GCT GGC CAAAGT CAGT AG GAGAGGAT GCAC AGT T T GCTA 197 8 

I | | | | | | I I I I I I I I I I I I I I I I II I I I I II Ml I II I I I I I I I I . 

Db 1914 AAGGG C CAAT C - T CATT CCCTGCTGGC CAAAGT CAGC AGAAGAAGGT G — AAGT T T GCCA 1970 

Qy 1979 TT T GCT T T AGAGACAGGGACT GT AT AAACAAGC CT AAC AT T GGT GCAAAGAT T GC CT CTT 2038 

I I I I I I I I I I I I I I I I I I I I I I I Ml I I I I I I I I I I Ml I I I I 

Db 1971 GTT GCT TTAGTGATAGGGACTGCAGACT CAAGCCT- ACACT GGTACAAAGACT GCGTCTT 2029 

Qy 2039 GAATTAAAAAAAAA 2052 

II I I II II II 

Db 2030 GAGATAAACAAGAA 2043 


RESULT 13 
US-09-548-372D-29 

; Sequence 29, Application US/09548372D 

; Patent No. 6420534 

; GENERAL INFORMATION: 

; APPLICANT: GURNEY ET AL . 

; TITLE OF INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR 
AND USES 

; TITLE OF INVENTION: THEREOF 
; FILE REFERENCE: 29915/62801 

; CURRENT APPLICATION NUMBER: US/09/548, 372D 

; CURRENT FILING DATE: 2000-04-12 

; PRIOR APPLICATION NUMBER: US 60/155,493 

PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: US 09/404,133 
; PRIOR FILING DATE: 1999-09-23 
; PRIOR APPLICATION NUMBER: PCT/US99/2088 1 

PRIOR FILING DATE: 1999-09-23 
; PRIOR APPLICATION NUMBER: US 60/101,594 

PRIOR FILING DATE: 1998-09-24 
; NUMBER OF SEQ ID NOS : 73 

SOFTWARE: Patent In version 3.1 
; SEQ ID NO 29 

LENGTH: 1362 
TYPE: DNA 

ORGANISM: Homo sapiens 


US-09-548-372D-29 


Query Match 65.7%; Score 1359; DB 4; Length 1362; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 135 9; Conservative 0; Mismatches 0; Inciels 0; Gaps 


Qy 

1 

ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 

60 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 > 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 


Db 

1 

ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 

60 

Qy 

61 

GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 

120 


I I M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 M 1 M 1 1 1 1 1 1 1 1 


Db 

61 

GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 

120 

Qy 

121 

CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 

180 


I I I I I I I I 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 M 


Db 

121 

CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 

180 

Qy 

181 

GT GGAGATGGT GGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACT ACGT GGAGAT GAC C 

240 


I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 

181 

GT GGAGAT GGT GGACAACCT GAGGGGCAAGTCGGGGCAGGGCTACT ACGT GGAGAT GACC 

240 

Qy 

241 

GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTT^ACTTTGCA 

300 


I | | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 


Db 

241 

GT G G G C AGCC C C C C GCAGAC GCT CAAC AT C CT GGT GGAT ACAGGCAGCAGT AACT T T GCA 

300 

Qy 

301 

GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 

360 


I I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 


Db 

301 

GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 

360 

Qy 

361 

TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 

420 


I M 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 

361 

TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 

420 

Qy 

421 

CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 

480 


I | | | | | | | | | 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 

421 

CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 

480 

Qy 

481 

GCT GC C AT CACT GAAT C AGACAAGT T CT T CAT CAAC GGCT C CAACT G GGAAGGCATCCT G 

540 


1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 


Db 

481 

GC T GC C AT CACT GAAT C AGACAAGTT CT T CAT CAAC GGC T C CAACT GG GAAG GCAT CCT G 

540 

Qy 

541 

GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 

600 


I | | | | | | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 

541 

GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 

600 

Qy 

601 

CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 

660 


I | M | | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 


Db 

601 

CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 

660 

Qy 

661 

C C C CT CAAC CAGT CT GAAGT G CT GG C CT CT GT C GGAGG GAGCAT GAT CAT T G GAGGTAT C 

720 


1 1 1 M II 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 

661 

CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGGTATC 

720 

Qy 

721 

GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTATTAT 

780 


I I I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 

721 

GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTATTAT 

780 


Qy 781 GAGGT CAT CAT T GT GC GGGT GGAGAT CAAT G GAC AGGAT CT GAAAAT GGACT G CAAGGAG 840 

I I I I I I I I I I I I I I M I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 781 GAGGT CAT CAT T GT GC GGGT GGAGAT CAAT G GAC AGGAT CT GAAAAT GGACT GCAAG GAG 840 

Qy 841 T ACAACT AT GACAAGAGCATT GT GGACAGT G GCAC CACCAAC CT T C GT TT GC CCAAGAAA 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 

Db 841 TACAACTATGACAAGAGCATTGTGGACAGTGGCACCACC7^CCTTCGTTTGCCCAAGAAA 900 

Qy 901 GT GT T T GAAGC T GC AGT CAAAT C CAT CAAGG CAGC CT CCT C C AC GGAGAAGT T C C CT GAT 960 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 901 GTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCTGAT 960 

Qy 961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 102 0 

I I I I I I I I I I I M I I I I I I I I I I I I I I I M II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 102 0 

Qy 1021 TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCACC 1080 

I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1021 TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCACC 1080 

Qy 1081 AT CC TT C C GCAGCAAT AC CT G C GGC C AGT GGAAGAT GT GGC C AC GT C C CAAGAC GACT GT 1140 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1081 AT CCTT C C G C AGCAAT AC CT GC G GC C AGT GGAAGAT GT GGC C AC GT C CCAAGAC GAC T GT 114 0 

Qy 1141 T ACAAGTTT GCCATCT CACAGT CAT CCAC GGGCACT GTT AT GGGAGCTGTTAT CATGGAG 12 00 

I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I 1 I I I I i I I I I I I II I I II I 
Db 1141 TACAAGTTT GCCATCT CACAGT CAT CCACGGGCACT GTT AT GGGAGCT GTTAT CATGGAG 1200 

Qy 1201 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1260 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1201 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1260 

Qy 1261 CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGACATG 1320 

I I I I I I M I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I 

Db 1261 CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGACATG 1320 

Qy 1321 GAAGAC T GT G G C T AC AAC AT T C C AC AGAC AGAT G AGT C A 1359 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1321 GAAGACTGT GGCT ACAACAT T CCACAGACAGAT GAGT CA 1359 


RESULT 14 
US-09-548-367D-29 

; Sequence 29, Application US/09548367D 

; Patent No. 6440698 

; GENERAL INFORMATION: 

; APPLICANT: GURNEY ET AL. 

TITLE OF INVENTION: ALZHEIMER 1 S DISEASE SECRETASE, APP SUBSTRATES THEREFOR 
AND USES 

; TITLE OF INVENTION: THEREOF 
; FILE REFERENCE: 29915/6280H 

; CURRENT APPLICATION NUMBER: US/ 09/54 8 , 367D 
; CURRENT FILING DATE: 2000-04-12 

PRIOR APPLICATION NUMBER: US 60/155,4 93 

PRIOR FILING DATE: 1999-09-23 
; PRIOR APPLICATION NUMBER: US 09/404,133 


PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: PCT/US99/208 81 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: US 60/101,594 
PRIOR FILING DATE: 1998-09-24 
NUMBER OF SEQ ID NOS : 73 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 29 
LENGTH: 1362 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-09-548-367D-29 

Query Match 65.7%; Score 1359; DB 4; Length 1362; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 1359; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

Qy 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I M I I I 
Db 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 120 

Qy 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 18 0 

I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 

Db 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 18 0 

Qy 181 GT GGAGAT GGT GGACAACCT GAGGGGCAAGTCGGGGCAGGGCT ACT ACGT GGAGATGACC 24 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 GT GGAGAT GGT GGACAACCT GAGGGGCAAGTCGGGGCAGGGCT ACT ACGT GGAGATGACC 24 0 

Qy 241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 300 

M I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 300 

Qy 301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 1 I I I I I I I I I I I I I I I I I I I I I I I 

Db 301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

Qy 361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I 
Db 361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 42 0 

Qy 421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCC7VACATT 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I! I I I I I 
Db 421 CTGGGCACCGACCTGGT7\AGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 480 

Qy 481 GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATCCTG 54 0 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 481 G CT GC C AT CACT GAAT C AGACAAGT T C T T CAT CAAC G GCT C CAAC T GGGAAGGC AT C CT G 54 0 

Qy 541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I I I I I I 

Db 541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 


Qy 601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

Qy 661 C C C CT CAAC CAGT CT GAAGT GCT GGCCTCTGT C GGAGGGAGC AT GAT CAT T G GAGGTAT C 720 

I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I ! I I I I I I I I i I I I II I I I I I I 
Db 661 C C C CT CAAC CAGT CT GAAGT GCT GGCCTCTGTC GGAGGGAGC AT GAT CAT T GGAGGT AT C 720 

Qy 721 GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTATTAT 78 0 

I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 721 GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTATTAT 7 80 

Qy 781 GAGGT CATCATT GTGCGGGT GGAGAT CAAT GGACAGGAT CT GAAAAT GGACT GCAAGGAG 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 781 GAGGT CAT C ATT GT GC GGGT GGAGAT CAAT GGACAGGAT CT GAAAAT GGACT GCAAG GAG 840 

Qy 841 TACAACT AT GACAAGAGCAT T GT GGAC AGT GGC AC CACCAAC CT T C GTT T GC CCAAGAAA 900 

II I I I I I I I I I I I I I I I I I I I I I M I I I I II I I I I I I I 1 I I I I I I I I I I I ! I I I I I I I I I 

Db 841 TACAACT AT GACAAGAGCAT TGT GGAC AGT GGCAC CACCAAC CT T C GT TT GC C C AAGAAA 900 

Qy 901 GT GTTT GAAG CT G C AGTCAAAT C CAT CAAG GC AGC CT CCT C CAC GGAGAAGT T C CCT GAT 960 

I I I I I I I M I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I 

Db 901 GTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCTGAT 960 

Qy 961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 1020 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 102 0 

Qy 1021 T T C C CAGT CAT CT CACT CT ACCTAAT GGGT GAG GT T AC CAAC CAGT C CT T C C GC AT C ACC 108 0 

I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I 
Db 1021 TTCCCAGTCATCTCACTCTACCT7VATGGGTGAGGTTACCAACCAGTCCTTCCGCATCACC 108 0 

Qy 1081 AT C CT T C C G C AG CAAT AC CT GC G GCC AGT GGAAGAT GTGGCCAC GT C C CAAGAC GACT GT 114 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1081 AT C CT T C C G C AG CAAT AC CT GC G GCC AGT GGAAGAT GT GGC CAC GT C C CAAGAC GACT GT 114 0 

Qy 1141 T ACAAGT TT GC C AT CT CAC AGT CATC CAC GGGCACT GTT AT G GGAGCT GTTAT CAT GGAG 1200 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1141 T ACAAGT TT GC CAT CT CACAGT CATC CAC GGGCACT GTTAT G GGAGCT GTTAT CAT GGAG 12 00 

Qy 1201 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1260 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I 
Db 1201 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1260 

Qy 1261 CAT GT GCAC GAT GAGT T C AGGAC GGC AG C GGT GGAAGGC C CTT T T GT CAC CT T GGACAT G 1320 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I 
Db 1261 CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGACATG 1320 

Qy 1321 GAAGACT GT GGCT ACAAC AT T C CACAGACAGAT GAGT C A 1359 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1321 GAAGAC T GT GGCT ACAAC AT T C CACAGACAGAT GAGT CA 1359 


RESULT 15 
US-09-551-853D-29 

; Sequence 29, Application US/09551853D 
; Patent No. 6500667 


GENERAL INFORMATION: 
APPLICANT : GURNEY ET AL. 

TITLE OF INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR 
AND USES 

TITLE OF INVENTION: THEREOF 
FILE REFERENCE: 29915/6280L 

CURRENT APPLICATION NUMBER: US/09/551, 853D 
CURRENT FILING DATE: 2000-04-18 
PRIOR APPLICATION NUMBER: US 60/155,493 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: US 09/404,133 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: PCT/US99/20881 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: US 60/101,594 
PRIOR FILING DATE: 1998-09-24 
NUMBER OF SEQ ID NOS : 73 
SOFTWARE: Patent In version 3.1 
SEQ ID NO 2 9 
LENGTH: 1362 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-09-551-853D-29 

Query Match 65.7%; Score 1359; DB 4; Length 1362; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 1359; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

I I I I I I II M I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

Qy 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 12 0 

Qy 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
Db 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 18 0 

Qy 181 GT GGAGAT GGT GGACAACCT GAGGGGCAAGT CGGGGCAGGGCTACTAC GT GGAGAT GACC 24 0 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 GT GGAGAT GGT GGACAACCT GAGGGGCAAGT CGGGGCAGGGCTACTACGT GGAGAT GACC 24 0 

Qy 241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 300 

Qy 301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

M I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 

Db 301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

Qy 361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 420 

I I I I I I I I I I I I I I II I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I E I I I I I I I I I 

Db 3 61 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 420 


Qy 


421 


CT G GGCAC C GACCT GGT AAGC AT C C C C C AT GG C C CCAAC GT CACT GT G C GT GC CAAC AT T 


480 


421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 4 80 

481 GCT G C CAT CACT GAAT CAGACAAGTT CT T CAT CAACGGC T C CAACT G GGAAG GCAT C CT G 54 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
481 GCT GC CAT CACT GAAT CAGACAAGTT CT T CAT CAACGGC T C CAACT G GGAAG G CAT C CT G 54 0 

541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

661 CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGGTATC 72 0 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I II I I I I I I I I I I I II I I 

661 C C C CT CAACC AGT CT GAAGT GCTGGCCTCT GT C GGAGGGAGCAT GAT CAT T G GAGGT AT C 72 0 

721 GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTATTAT 7 8 0 

I I I I I II i I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I 

721 GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTATTAT 78 0 

781 GAGGTCAT CATT GT GCGGGT GGAGAT CAAT GGACAGGAT CT GAAAAT GGACT GCAAGGAG 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
781 GAGGT CAT CATT GT G C GGGT GGAGAT CAAT GGACAGGAT CT GAAAAT GGACT GCAAGGAG 84 0 

841 TACAACT AT GACAAGAGC AT T GT G GACAGT GG CAC C AC CAAC CT T CGTT T GC C CAAGAAA 900 

I I I- I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I 
841 TACAACT AT GACAAGAGCAT T GT G GACAGT GGCAC CAC C AAC CT T C GTT T GC C CAAGAAA 900 

901 GT GT T T GAAG CT GCAGT CAAAT C CAT CAAG GCAGCCT C C T C CAC GGAGAAGT T C CCT GAT 960 

I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
901 GT GT T T GAAGCT GCAGT CAAAT C CAT CAAGGC AGCCT C C T C CAC G GAGAAGT T C C CT GAT 960 

961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 1020 

I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I 

961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 1020 

1021 TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCACC 10 80 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
1021 T T C C CAGT CAT CT CACT CT AC CTAAT GGGT GAGGTT AC CAAC CAGTC CT T C C GCAT CAC C 1080 

1081 AT C CT T CC GC AGCAAT AC CT G C GGC C AGT GGAAGAT GT GGC CAC GT C C CAAGAC GACT GT 114 0 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1081 AT CCT T CCG C AGCAAT AC CT GC GGC CAGT GGAAGATGT GGC CAC GT CC CAAGAC GACT GT 1140 

1141 T ACAAGTT T GC CAT C T CAC AGT CAT C C AC GG GC ACT GT TAT GGGAGCT GT TAT CATGGAG 12 00 

I I I I I I I I I I I I I I I I I I I I I I I I I I i I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1141 T ACAAGTT T GC C AT CT CAC AGT CAT CC AC GGG CACT GT TAT GGGAGCT GT TAT CATG GAG 12 00 

1201 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1260 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1201 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1260 

1261 CAT GT GCAC GAT GAGT TC AG GAC GG C AGC GGT GGAAGGC C CT T T T GT C AC CT T GGACAT G 132 0 
I I I I I I II I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I M I I I I I I I 


Db 1261 CAT GT GCAC GAT GAGTT C AG GAC G GCAGC GGT G GAAG GC C CTT TTGT C AC CT T GGAC AT G 1320 


Qy 1321 GAAGACT GT GGCTACAACATTCCACAGACAGAT GAGT CA 1359 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1321 GAAGACT GT GGCTACAACATTCCACAGACAGAT GAGT CA 1359 
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ALIGNMENTS 


RESULT 1 
US-09-794-927-3 

; Sequence 3, Application US/09794927 

; Patent No. US20010016324A1 

; GENERAL INFORMATION: 

; APPLICANT: Gurney, Mark E. 


AND 


APPLICANT: Bienkowski, Michael J. 
APPLICANT: Heinrikson, Robert L. 
APPLICANT: Parodi, Luis A. 
APPLICANT: Yan, Riqiang 

TITLE OF INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR, 

TITLE OF INVENTION: USES 

TITLE OF INVENTION: THEREFOR 

FILE REFERENCE: 28341/6280FG 

CURRENT APPLICATION NUMBER: US/ 09/7 94, 927 

CURRENT FILING DATE: 2001-02-27 


PRIOR APPLICATION NUMBER 
PRIOR FILING DATE: 1999- 
PRIOR APPLICATION NUMBER 
PRIOR FILING DATE: 1999- 
PRIOR APPLICATION NUMBER 
PRIOR FILING DATE: 1999- 
PRIOR APPLICATION NUMBER 
PRIOR FILING DATE: 1999- 
PRIOR APPLICATION NUMBER 
PRIOR FILING DATE: 1998-09-24 
NUMBER OF SEQ ID NOS : 73 
SOFTWARE: PatentlnVer. 2.0 
SEQ ID NO 3 
LENGTH: 2 07 0 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-09-794-927-3 


09/416, 901 
10-13 

60/155, 493 
09-23 

09/404, 133 
09-23 

PCT/US99/20881 
09-23 

60/101,594 


Query Match 100.0%; Score 2070; DB 9; 

Best Local Similarity 100.0%; Pred. No. 0; 
Matches 207 0; Conservative 0; Mismatches 


Length 2 070; 
0; Indels 0; 


Gaps 


0; 


QY 
Db 


1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I 
1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 


Qy 

Db 


61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I 

61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 12 0 


Qy 


Db 


121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 180 


Qy 

Db 

Qy 

Db 


181 GT GGAGAT GGTGGACAACCT GAGGGGCAAGT CGGGGCAGGGCTACTACGT GGAGAT GACC 24 0 

I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

181 GT GGAGAT GGTGGACAACCT GAGGGGCAAGT CGGGGCAGGGCTACTACGT GGAGAT GAC C 24 0 

241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTT^ACTTTGCA 3 00 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I 

241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 300 


Qy 

Db 


301 
301 


360 


360 


Qy 361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 42 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 420 

Qy 421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 480 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 4 80 

Qy 4 81 GCT GC CAT C AC T GAAT C AGACAAGT TC T T CAT CAAC GGCT C CAACT GGGAAGG CAT C CT G 54 0 

I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I 
Db 4 81 G CT GC CAT CACT GAAT C AGACAAGT T CT T C AT CAAC GGCT C CAACT GG GAAGGCAT C CT G 540 

Qy 541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | M I I I I I I I I I I I I 

Db 541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

Qy 601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

I I I I I I II I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I f I I I I I I I I I I I I I I I I I 1 
Db 601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

Qy 661 CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGGTATC 72 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 

Db 661 CC CCT CAACCAGT CTGAAGTGCT GGCCT CT GT CGGAGGGAGCATGAT CATT GGAGGTAT C 72 0 

Qy 721 GACCACT C GCT GTACACAGGCAGT CTCT GGTATACACCCAT CCGGC GGGAGT GGT ATTAT 7 80 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 721 GAC CACT C GCT GTACACAGGCAGT CTCT GGTAT AC ACC CAT C C GG C GGGAGT G GT ATTAT 7 80 

Qy 781 GAGGT CAT CATTGTGC GGGTGGAGATCAAT GGACAGGATCT GAAAAT GGACTGCAAGGAG 84 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I M I I I I I I I I I I I I I I I I 
Db 781 GAGGT CAT CAT T GT GC GG GT GGAGAT CAAT GGAC AGGAT CT GAAAAT GGAC T GCAAGGAG 840 

Qy 841 T ACAACT ATGACAAGAG CATT GT GGAC AGT GGCAC CAC CAAC CT T C GTT T GC C CAAGAAA 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I 

Db 841 TACAACTATGACAAGAGCATT GT GGACAGT GGCAC CACCAACCTT CGTTT GCCCAAGAAA 900 

Qy 901 GT GT T T GAAGC T GCAGT CAAAT C CAT CAAGGCAG C CT C CT C CAC GGAGAAGTT C CCT GAT 960 

I II I I I I I I I I I I II II II I I I I I I I I II I I M II I I I I I I I I I I I I I I I M I I I I I I I I 

Db 901 GT GT T T GAAGC T G CAGT CAAAT C CAT CAAGGCAGC CTC CT C CAC GGAGAAGT T C C CT GAT 960 

Qy 961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 1020 

I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I M I I I I I I 
Db 961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 1020 

Qy 1021 T T CC CAGT CAT CT C ACT CT AC CT AAT G G GT GAG GT T AC CAACCAGT C C TT C C GC AT CACC 1080 

I I I II I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1021 TT CC CAGT CAT CT CAC T CT AC C T AAT G G GT GAGGTT AC CAAC CAGT C C T T C C GCAT CAC C 1080 

Qy 1081 AT CC T T C C GCAGCAAT AC CT GC GGCC AGT GGAAGAT GT GGC C AC GT C C CAAGAC GACT GT 1140 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
Db 1081 AT CCTTCCGCAGCAATACCTGCGGCCAGT GGAAGAT GTGGCCACGTCCCAAGACGACTGT 114 0 

Qy 1141 T ACAAGT TT G C C ATC T C ACAGT CATC CAC GGGCAC T GT TAT GGGAGCT GTT AT CAT GGAG 1200 

I I I I I I I I I I I I I I I I M I I M ! I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | 
Db 1141 T ACAAGT TT G C C ATCT C ACAGT CAT C C AC GGGCACT GT TAT GGGAGCT GT TAT CAT GGAG 1200 


Qy 1201 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 12 60 


1201 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1260 

12 61 CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGACATG 1320 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
12 61 CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGACATG 132 0 

1321 GAAGAC T GT GGCTAC AAC AT T C C ACAGAC AGAT GAGT CAAC CCT CAT GAC CAT AG CC TAT 13 8 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1321 GAAGACTGT GGCTACAACATT CCACAGACAGATGAGTCAACCCT CATGACCATAGCCTAT 1380 

1381 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 144 0 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I 

1381 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 14 4 0 

1441 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1441 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 

1501 AAGT GAGGAGGC C C AT GGGC AGAAGAT AGAGAT T C C CCT GGAC C AC AC CT C C GT GGT T CA 1560 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 

1501 AAGT GAGGAG GC C CAT GGGCAGAAGAT AGAGAT T C CC CT GGAC C AC AC CT C C GT GGTT C A 1560 

1561 CTTTGGTCACAAGTAGGAGACACAGATGGCACCTGTGGCCAGAGCACCTCAGGACCCTCC 1620 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1561 CTTTGGTCACAAGTAGGAGACACAGATGGCACCTGTGGCCAGAGCACCTCAGGACCCTCC 1620 

1621 CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCAG 1680 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I. I I I I I I I I I I I I I I I II I I I I I II I I I I I I 
1621 CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCAG 1680 

1681 GGACT GT AC CT GT AGGAAACAGAAAAGAGAAGAAAGAAGC ACT CT GCT GG C GGGAAT ACT 174 0 

I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1681 GGACT GT AC CTGTAGGAAACAGAAAAGAGAAGAAAGAAGCACT CTGCT GGC GGGAAT ACT 1740 

1741 CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCT 1800 

II I I I I I I I I I I I I I I II I I I I- I I I I I I II I I I I I I I I I I II I I I I I I I II I I I I I I I I I 
1741 CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCT 18 00 

1801 TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 18 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I M I I I 
1801 TTGTCCACCATTCCTTTA7\ATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 18 60 

18 61 GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAAG 1920 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I II I I I I I I I I I II 
1861 GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAAG 1920 

1921 AGACCAAG CT T GTTTCCCTGCT G GC CAAAGT C AGT AGGAGAGGAT G CACAGT T T GC TAT T 19 80 

I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1921 AGAC CAAGC T T GT T T CC CT GCT G GC CAAAGT C AGTAGGAGAGGATGCACAGT T T GCT AT T 1980 

1981 T GCTTTAGAGACAGGGACT GT ATAAACAAGCCTAACATT GGT GCAAAGATT GC CTCTT GA 2040 

M I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1981 TGCTTTAGAGACAGGGACTGTATAAACAAGCCTAACATTGGTGCAAAGATTGCCTCTTGA 204 0 

2041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2070 
I II I I I I I I I I I I I I I I I I II I I I I I I I I I 


Db 2041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2070 


RESULT 2 
US-09-795-847-3 

Sequence 3, Application US/09795847 
Patent No. US20010018208A1 
GENERAL INFORMATION: 


APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 


Gurney, Mark E. 
Bienkowski, Michael J. 
Heinrikson, Robert L. 
Parodi, Luis A. 
Yan, Riqiang 


AND 


TITLE OF INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR, 

TITLE OF INVENTION: USES 

TITLE OF INVENTION: THEREFOR 

FILE REFERENCE: 28341/6280DE 

CURRENT APPLICATION NUMBER: US/09/795, 847 

CURRENT FILING DATE: 2001-02-28 


PRIOR APPLICATION NUMBER 


PRIOR FILING DATE: 


1999 


PRIOR APPLICATION NUMBER 


PRIOR FILING DATE: 


1999 


PRIOR APPLICATION NUMBER 


PRIOR FILING DATE: 


1999 


PRIOR APPLICATION NUMBER 

1999 


PRIOR FILING DATE 
PRIOR APPLICATION NUMBER 
PRIOR FILING DATE: 1998-09-24 
NUMBER OF SEQ ID NOS : 73 
SOFTWARE: PatentlnVer. 2.0 
SEQ ID NO 3 
LENGTH: 2 070 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-09-795-847-3 


09/416, 901 
10-13 

60/155, 493 
09-23 

09/404, 133 
09-23 

PCT/US99/20881 
09-23 

60/101,594 


Query Match 100.0%; Score 2070; DB 9; 

Best Local Similarity 100.0%; Pred. No. 0; 
Matches 207 0; Conservative 0; Mismatches 0; 


Length 2070; 
Indels 0; 


Gaps 


0; 


Qy 

Db 


1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 


Qy 

Db 


61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 12 0 


Qy 


Db 


121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 18 0 


Qy 


Db 


181 GT GGAGAT GGT GGACAACCT GAGGGGCAAGTCGGGGCAGGGCTACTACGT GGAGATGACC 24 0 

I I I I I M I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
181 GT GGAGAT GGT GGACAACCT GAGGGGCAAGTCGGGGCAGGGCTACTACGT GGAGATGACC 24 0 


241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I II I I I I I II 
241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 300 

301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I 
301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 420 

421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 480 

I I M I I I I I I I I I I I I I I I I I I I I M I I i I I I I I I I I M I I | | | | | || | | | | |.| | | | | | | 
421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 480 

481 G CT G CC AT CACT GAAT C AGACAAGT T CT T CAT CAAC GGCT CCAACT G GGAAGG C AT C CT G 540 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | 
4 81 GCT GCCAT CACT GAAT CAGACAAGTTCTT CAT CAAC GGCT CCAACT GGGAAGGCAT CCT G 54 0 

541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

I I I I I I M I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I | | | | | 
541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I 
601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

661 C C C CT CAAC C AGT C T GAAGT GCTGGCCT CT GT CGGAGGGAGCAT GAT CAT T GGAGGT AT C 72 0 

I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I | M I I I I I I I I I I I I I I 

661 C C C CTCAAC CAGT CT GAAGT GCTGGCCTCT GT CGGAG GGAGCAT GAT CAT T GGAGGT AT C 72 0 

721 GAC CACT C G CT GT ACACAGG C AGT CT C T GGT AT ACAC C CAT C C GGC GG GAGT G GT AT TAT 78 0 

I I I I I I I I I I M I I I I M I I II I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | 
721 GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTATTAT 78 0 

781 GAGGT CAT CATT GT G C GGGT G GAGAT CAAT GGACAGGAT CT GAAAAT G GACT GCAAGGAG 84 0 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
781 GAGGT CAT C ATT GT GC GGGT GGAGAT CAAT GGACAGGAT CT GAAAAT GGACT GCAAGGAG 84 0 

841 T ACAAC TAT GACAAGAGCAT T GT GGAC AGT GGC AC CAC CAAC C T T C GT T T GC C CAAGAAA 900 

I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | || | | | | | | | | | 
841 TAC7^ACTATGACAAGAGCATTGTGGACAGTGGCACCACC7\ACCTTCGTTTGCCCAAGAAA 900 

901 GT GTT T GAAG CT GCAGT CAAAT C CAT CAAGGCAG C CT C C T C CAC GGAGAAGT T C C CT GAT 960 

I I I I I I I I I I II I M I i I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | 
901 GT GTTT GAAGCT G CAGT CAAAT C CAT CAAGGCAGC CT C CT C CAC G GAGAAGT T CC CT GAT 960 

961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 102 0 

I M I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 1020 

1021 TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCACC 108 0 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1021 TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCACC 108 0 


Qy 1081 ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGACTGT 114 0 

I I I I M I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | I I I I I I I I 
Db 1081 AT C CT T C C GCAGCAAT AC CT GC GG C CAGT G GAAGAT GT GGC C AC GT C C CAAGAC GACTGT 114 0 

Qy 1141 TACAAGTTTGC CATCT CACAGTCAT CCACGGGCACTGTTAT GGGAGCTGTTAT CATGGAG 1200 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1141 T ACAAGT T T GC CAT C T CAC AGT CAT C C ACG GG CACT GTT AT GGGAGCT GTTAT CAT GGAG 1200 

Qy 12 01 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 12 60 

I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 
Db 1201 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1260 

Qy 12 61 CAT GT GC ACGAT GAGT T CAGGAC G GCAGC G GT G GAAG GC C C T T T T GT CAC CTT GGAC AT G 132 0 

I I I I I I I M I I I I I I I I I I I I II I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 
Db 1261 CAT GT G CAC GAT GAGT T CAGGAC GG CAGC GGT GGAAGGC C CT T T T GT CAC CT T GGAC AT G 1320 

Qy 1321 GAAGACT GTGGCT ACAACATT CCACAGACAGATGAGTCAACCCT CATGAC CATAGCCTAT 1380 

I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1321 GAAGACT GT G GCT ACAACAT T C C ACAGAC AGAT GAGT CAAC C CT C AT GAC CATAGC CT AT 1380 

Qy 1381 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 144 0 

I I I I I I M I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I 
Db 1381 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 1440 

Qy 1441 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 1441 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 

Qy 1501 AAGT GAGGAGGC C C AT GGGCAGAAGAT AGAGAT T C C CCT G GAC C ACAC CT C C GT GGTT C A 1560 

I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1501 AAGT GAGGAG GC C CAT GGG C AGAAGAT AGAGATT C CC C T GGAC C ACAC CT C C GT GGT T C A 1560 

Qy 1561 CTTTGGTCACAAGTAGGAGACACAGATGGCACCTGTGGCCAGAGCACCTCAGGACCCTCC 1620 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I 
Db 1561 CTTT GGTCACAAGT AGGAGACACAGATGGCACCT GTGGC CAGAGCACCT CAGGACCCT CC 1620 

Qy 1621 CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCAG 1680 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1621 CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCAG 168 0 

Qy 1681 GGAC T GT AC CT GT AG GAAACAGAAAAGAGAAGAAAGAAGCAC T CT GCT GGC GG GAAT AC T 1740 

I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 

Db 1681 G GACT GT AC CT GT AG GAAACAGAAAAGAGAAGAAAGAAGC ACT C T GCT GGC GGGAAT ACT 174 0 

Qy 1741 CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCT 1800 

I I I I I I I I M I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I M 
Db 1741 CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCT 1800 

Qy 1801 TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 1860 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1801 TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 1860 

Qy 1861 GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAAG 1920 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 1861 GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAAG 1920 

Qy 1921 AGAC CAAGC T T GT T T C C CT GC T GGCCAAAGT CAGT AGGAGAGGAT GCAC AGT T TGC T AT T 1980 


Db 


I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1921 AGAC CAAG CT T GT TTCCCTGCTGGC CAAAGT CAGT AGGAGAGGAT GC AC AGTT T GCT ATT 198 0 


Qy 1981 T GCT T T AGAGAC AG GGACT GTATAAACAAG C CT AAC ATT GGT GCAAAGAT T GC CT CT T GA 204 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 
Db 1981 T GCT T T AGAGAC AGGGACT GT ATAAACAAGC CTAAC AT T GGTGCAAAGATT GC CT CTT GA 204 0 

Qy 2041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2070 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
Db 2041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2070 


RESULT 3 
US-09-794-743-3 

Sequence 3, Application US/09794743 
Patent No. US2001002 1391A1 
GENERAL INFORMATION: 


APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 


Gurney, Mark E. 
Bienkowski, Michael J. 
Heinrikson, Robert L. 
Parodi, Luis A. 
Yan, Riqiang 


AND 


TITLE OF INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR, 

D 

TITLE OF INVENTION: USES 
TITLE OF INVENTION: THEREFOR 
FILE REFERENCE: 28341/6280BC 
CURRENT APPLICATION NUMBER: US/09/7 94,74 3 
CURRENT FILING DATE: 2001-02-27 
PRIOR APPLICATION NUMBER 
PRIOR FILING DATE: 1999 
PRIOR APPLICATION NUMBER 
PRIOR FILING DATE: 1999 


PRIOR APPLICATION NUMBER 


PRIOR FILING DATE: 


1999 


PRIOR APPLICATION NUMBER 


PRIOR FILING DATE: 


1999 


PRIOR APPLICATION NUMBER 


09/416, 901 
10-13 

60/155,493 
09-23 

09/404, 133 
09-23 

PCT/US99/20881 
09-23 

60/101,594 


PRIOR FILING DATE: 1998-09-24 
NUMBER OF SEQ ID NOS : 7 3 
SOFTWARE: PatentlnVer. 2.0 
SEQ ID NO 3 
LENGTH: 2 070 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-09-794-743-3 


Query Match 100.0%; Score 2070; DB 9; 

Best Local Similarity 100.0%; Pred. No. 0; 
Matches 2 070; Conservative 0; Mismatches 0; 


Length 2 070; 
Indels 0; 


Gaps 


0; 


Qy 

Db 


1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 


Qy 


61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 
I I f I I I I I II I I I I I I I I I I I I I I | I | I | I | | | | | | | | I I I I I I I I I I I | | | | | | | | | | | 


120 


Db 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 120 

Qy 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I II I I I I I I II I I I I I 

Db 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 180 

Qy 181 GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGATGACC 24 0 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I! I I I I I I I I I I I I I | | | | M I I I I I I I I I I 

Db 181 GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGATGACC 240 

Qy 241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 300 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I M I I I I I I I 
Db 241 GTGGGCAGCCCCCCGCAGACGCTCT^ACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 300 

Qy 301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I | I I I I I II 
Db 301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

Qy 361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 420 

I I I I II I I I I I I I II I I I I I I I I I I I II I I M I I I I I I I I I I I I I I II M I I I I I I I I I I 
Db 361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 420 

Qy 421 CTGGGCACCGACCTGGT7VAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 480 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 480 

Qy 4 81 GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATCCTG 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I II 
Db 4 81 GCT GCCAT CACTGAAT CAGACAAGTT CTT CAT CAACGGCTCCAACT GGGAAGGCATCCT G 540 

Qy 541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

Qy 601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I II I I I I I I II I I I I I I I 
Db 601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

Qy 661 CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGGTATC 72 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
Db 661 C C C CT CAAC CAGT C T GAAGT GCTGGCCTCT GT CGGAGGGAGC AT GAT CAT T G GAGGT AT C 720 

Qy 721 GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTATTAT 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I II I I I I I I I I 
Db 721 GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTATTAT 78 0 

Qy 781 GAG GT CAT CAT T GT GC GG GT GGAGATCAAT G GAC AGGAT CT GAAAAT GGAC T GCAAGGAG 84 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 781 GAGGT CAT CAT T GT GC GGGT GGAGATCAAT GGAC AGGAT CT GAAAAT GGAC T GCAAGGAG 84 0 

Qy 841 TACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCCAAGAAA 900 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
Db 841 TACAACTATGACAAGAGCATTGTGGACAGTGGCACCACC7\ACCTTCGTTTGCCCAAGAAA 900 

Qy 901 GT GT TT GAAGCTG C AGT CAAAT C CAT CAAGGCAG C CT C CT C C AC GGAGAAGT T C CCT GAT 960 

I I I I I I I I I I M I I I I I I I I I I I I I I I I I II I I I I I II I I I I I II I I II II I I I I I I I I I 
Db 901 GT GTT T GAAG CT GC AGT CAAAT C CAT CAAG GC AGC CT C CT C C AC GGAGAAGT T C CCT GAT 960 


Qy 961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 102 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 102 0 

Qy 1021 TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCACC 1080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 

Db 1021 TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCACC 108 0 

Qy 1081 ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGACTGT 114 0 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 

Db 1081 ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGACTGT 114 0 

Qy 1141 T ACAAGT T TGC CAT CT C AC AGT CAT C CAC GGGCAC T GT T AT GGGAGC T GT TAT CAT GGAG 1200 

I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1141 T ACAAGT TT G C CAT CT CAC AGT C AT CC AC GGGCACT GT TAT GGGAGCT GT TAT CAT GGAG 1200 

Qy 12 01 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 12 60 

I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 

Db 1201 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1260 

Qy 1261 CAT GT GC ACGAT GAGT T CAG GAC G GC AG C GGT G GAAGGC C CTT TT GT CAC C T T GGAC AT G 132 0 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1261 CAT GT GC ACGAT GAGT T CAGGAC GGC AG C GGT GGAAGGC C CT T T T GT CAC CTT GGAC AT G 1320 

Qy 1321 GAAGACT GTGGCTACAACATT CCACAGACAGAT GAGT CAACCCT CAT GACCATAGCCT AT 1380 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I 
Db 1321 GAAGACT GTGGCTACAACATT CCACAGACAGAT GAGTCAACC CT CAT GACCATAGCCTAT 138 0 

Qy 1381 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 144 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1381 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 1440 

Qy 1441 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I M I I I I I I I I 
Db 1441 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 

Qy 1501 AAGT GAGGAGGC C CAT GGGC AGAAGAT AGAGAT T C C CCT GGAC CAC ACCT C C GT GGT T C A 1560 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1501 AAGT GAGGAG GC C CAT GG GC AGAAGAT AGAGAT T C C CCT GGAC CAC ACCT C C GT GGTT C A 1560 

Qy 1561 CT TT GGT C ACAAGT AGGAGAC AC AGAT GGC AC CT GT GGC C AGAGC AC CT CAGGAC C CT C C 162 0 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1561 CT TT GGT C ACAAGT AGGAGACAC AGAT G G CAC C T GT GG C CAGAGC AC CT CAGGAC C CT C C 1620 

Qy 1621 C CAC C CAC CAAAT GC CT CT GC CTT GAT G GAGAAGGAAAAGGCT GGCAAG GT GGGT T C CAG 1680 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I 
Db 1621 CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCAG 1680 

Qy 1681 GGACT GT AC CT GT AGGAAAC AGAAAAGAGAAGAAAGAAG C ACT CT GCTGGCGG GAAT ACT 1740 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I IE I I I I I I I I 
Db 1681 GGACT GT ACC T GT AG GAAAC AGAAAAGAGAAGAAAGAAGC ACT CT GCT GGC GGGAAT ACT 174 0 

Qy 1741 CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCT 18 00 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
Db 1741 CTT GGT CAC CT CAAAT TTAAGTCGGGAAATTCT GCT GCT TGAAACTTCAGCCCTGAACCT 1800 


Qy 

Db 


1801 
1801 


1860 
1860 


Qy 1861 GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAAG 1920 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1861 GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAAG 1920 

Qy 1921 AGAC CAAGCT T GTTTCCCTGCT GGC CAAAGT C AGT AGGAGAGGAT GCAC AGT T T GCT ATT 198 0 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
Db 1921 AGAC CAAGCT T GT T T C C CT GCT GGC CAAAGT C AGT AGGAGAGGAT GCAC AGT T T GCT ATT 198 0 

Qy 1981 TGCTTTAGAGACAGGGACTGTATAAACAAGCCTAACATTGGTGCAAAGATTGCCTCTTGA 2 04 0 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 

Db 1981 T GCTT T AGAGAC AGGGACT GT AT AAACAAGC CTAACAT T GGT GCAAAGAT T GC CT CTT GA 2 04 0 

Qy 2041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2070 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2 041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2 07 0 
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Qy 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

Qy 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I II I I I I I I I I I I I I I I 

Db 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 12 0 

Qy 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 18 0 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I M 
Db 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 18 0 

Qy 181 GT GGAGAT GGTGGACAACCT GAGGGGCAAGTC GGGGCAGGGCTACTACGT GGAGATGACC 24 0 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 

Db 181 GT GGAGAT GGT G GACAAC CT GAGGGGCAAGT C GGG GC AGG GCT ACTAC GT GGAGAT GAC C 24 0 

Qy 241 GT GGGCAGCCCC CC GCAGACGCT CAACAT C CT GGT GGAT ACAGGCAGCAGT AACTTT GCA 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 300 

Qy 301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

Qy 361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 42 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 42 0 

Qy 421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I 
Db 421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 480 

Qy 481 GCTGCCAT CACT GAAT CAGACAAGTT CTTCAT CAACGGCT CCAACT GGGAAGGCAT C CTG 54 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATCCTG 54 0 

Qy 541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

I! I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I 

Db 541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

Qy 601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I 

Db 601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

Qy 661 C C C CT CAAC CAGT CT GAAGT GCTGGCCTCTGTCG GAGGGAGC AT GAT CAT T G GAG GT AT C 720 

II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I 

Db 661 CCCCTCAACCAGTCT GAAGT GCTGGCCTCTGTCGGAGGGAGCAT GAT CATTGGAGGT AT C 72 0 

Qy 721 GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTATTAT 780 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 721 GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTATTAT 780 

Qy 781 GAGGT CAT C ATT GT GC GG GT GGAGAT CAAT GGAC AGGAT CT GAAAAT GGACT GCAAGGAG 84 0 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 


Db 781 GAGGT CAT C ATT GT GC GGGT GGAGAT CAAT GGAC AGGATCT GAAAAT GGACT G CAAGGAG 840 

Qy 841 T ACAACT AT GACAAGAGCAT T GT G GAC AGT GGC AC C AC CAAC CT T C GT T T GC C CAAGAAA 900 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 841 T ACAACT AT GACAAGAGCAT T GT GGACAGT GGC AC CAC CAAC CTT C GT T T GC C CAAGAAA 900 

Qy 901 GTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCTGAT 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 

Db 901 GT GT T T GAAGCT GC AGT CAAAT C CAT CAAG GCAGC CT C CT C CAC GGAGAAGT T C C CT GAT 960 

Qy 961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 1020 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 1020 

Qy 1021 TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCACC 1080 

I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1021 T TCC C AGT CAT CT CACT CT AC CT AAT GGGT GAG GT T AC CAAC CAGT C CT T C C GC AT CAC C 108 0 

Qy 1081 ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGACTGT 1140 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I 

Db 1081 AT C CTT C C GC AGCAAT AC CT GC GGC CAGT G GAAGAT GT GGC CAC GT C C CAAGAC GACT GT 1140 

Qy 1141 TACAAGTTT GCCATCTCACAGTCAT CCAC GGGCACTGTT AT GGGAGCT GTT AT CAT GGAG 1200 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I 
Db 1141 TACAAGTTT GCCATCTCACAGT CAT CCAC GGGCACTGTT AT GGGAGCT GTT AT CAT GGAG 1200 

Qy 1201 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1260 

I I II I I I I I I I II I I I I I I I I M I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1201 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1260 

Qy 1261 CAT GT GC AC GAT GAGT T CAGGAC GGCAGC G GT GGAAGG CC CT T T T GT CACCT T GGAC AT G 1320 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1261 CAT GT GC AC GAT GAGT T CAGGAC G GCAGC GGT GGAAGGCC CT T T T GT CACCT T GGAC AT G 1320 

Qy 1321 GAAGACT GT GGCTACAACATT CCACAGACAGAT GAGT CAAC CCT CATGACCATAGCCTAT 1380 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I 
Db 1321 GAAGACT GT GGCTACAACATT CCACAGACAGAT GAGT CAAC CCT CAT GACCATAGCCT AT 1380 

Qy 1381 GT CAT GGCT GCCAT CT GC GC C CT CTT CAT GCTGC CACT CTG CCT CAT GGT GTGT CAGT GG 1440 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1381 GT CAT GGCT GCCAT CTGCGCCCT CTT CAT GCTGCCACTCTGCCT CAT GGT GTGT CAGT GG 14 40 

Qy 1441 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1441 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 

Qy 1501 AAGT GAGGAGGCC CAT GGGC AGAAGAT AGAGAT TCC C CTGGAC C AC AC CT CC GT GGT T C A 1560 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1501 AAGT GAGGAGGC C CAT GGGC AGAAGAT AGAGAT TCC C CT GGAC C AC AC CT CC GT GGT T CA 1560 

Qy 1561 C TTT GGT C ACAAGT AGGAGAC ACAGAT GG C AC CT GT GGC CAGAGCAC CT CAGGAC C CT C C 1620 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I M I I I I I I I I I I I I I I 

Db 1561 CTTT GGT C AC AAGTAGGAGAC ACAGAT GGC AC CT GTGGCCAGAGCAC CT CAGGACC CT CC 1620 

Qy 1621 CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCAG 1680 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1621 CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCAG 1680 


Qy 1681 
Db 1681 


1740 


1740 


Qy 1741 CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCT 18 00 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1741 CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCT 18 00 

Qy 1801 T T GT C C AC CAT T C CT T TAAATT C T C CAAC C CAAAGTATT CT T CT T TT CTT AGT TT C AGAA 18 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I. I I I I I I I I I I I I 
Db 1801 T T GT C C AC CAT T C CT T TAAATT CT C CAAC C CAAAGTATT CT T CT T TT CTT AGT TT CAGAA 18 60 

Qy 1861 GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAAG 192 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1861 GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAAG 1920 

Qy 1921 AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCTATT 198 0 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1921 AGACCAAGCT T GTTT CC CT GCT G GC CAAAGT C AGT AGGAGAG GAT GCAC AGT T T GC T ATT 198 0 

Qy 1981 T GCT T T AGAGAC AGGGACT GT ATAAAC AAGCC TAACATT GGT GCAAAGAT T GC CT CT T GA 204 0 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1981 T GCT TT AGAGAC AGGGACT GT AT AAACAAGCC TAACATT GGT GCAAAGAT TGC CTCTT GA 204 0 

Qy 2041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2070 

I I I I I I I I I I I I I I I I I I I M I I I I I I I I I 
Db 2041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2070 


RESULT 5 
US-09-794-925-3 

Sequence 3, Application US/09794925 
Patent No. US20020064819A1 
GENERAL INFORMATION : 
APPLICANT: Gurney, Mark E. 
APPLICANT: Bienkowski, Michael J. 
APPLICANT: Heinrikson, Robert L. 
APPLICANT: Parodi, Luis A. 
APPLICANT: Yan, Riqiang 

TITLE OF INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR, 
AND USES 

TITLE OF INVENTION: THEREFOR 
FILE REFERENCE: 28341/6280HI 
CURRENT APPLICATION NUMBER: US/09/794 , 925 
CURRENT FILING DATE: 2001-02-27 
PRIOR APPLICATION NUMBER: 09/416,901 
PRIOR FILING DATE: 1999-10-13 
PRIOR APPLICATION NUMBER: 60/155,493 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: 09/404,133 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: PCT/US99/20881 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: 60/101,594 
PRIOR FILING DATE: 1998-09-24 
NUMBER OF SEQ ID NOS : 73 


SOFTWARE: Patentln Ver, 2.0 
SEQ ID NO 3 
LENGTH: 207 0 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-09-794-925-3 

Query Match 100.0%; Score 2070; DB 9; Length 2070; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 2070; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

Qy 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I M I I I I I I I I I I I I I I I I I I I II 
Db 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 12 0 

Qy 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 18 0 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I 

Db 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 18 0 

Qy 181 GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGATGACC 24 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 GT GGAGAT GGT GGACAAC CT GAG GGGCAAGT C GGGG C AGGGCT AC T AC GT G GAGAT GACC 24 0 

Qy 241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 300 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I 

Db 241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 300 

Qy 301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

Qy 3 61 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 42 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I II I I I I I I II I I I I I 

Db 361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 42 0 

Qy 421 C T GGGC AC C GAC CT GGT AAGCAT C CC C C AT GGC C C CAAC GT CACT GT GC GT GC CAAC ATT 48 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I .1 I I I I I I I I I I I I I I I I I I I I I II M I I I 

Db 421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 480 

Qy 481 GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATCCTG 54 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
Db 481 GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATCCTG 54 0 

Qy 541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I 

Db 541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

Qy 601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I 
Db 601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 


Qy 


661 C C CCT CAAC C AGT CT GAAGT GCT GGCCTCTGT C GGAGGGAG CAT GAT C ATT GGAGGT AT C 72 0 
I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 


Db 


661 CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGGTATC 720 


Qy 721 GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTATTAT 780 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 

Db 721 GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTATTAT 780 

Qy 7 81 GAGGT CATCATT GTGCGGGTGGAGATCAAT GGACAGGATCT GAAAAT GGACTGCAAGGAG 84 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 781 GAGGT CAT CAT T GT GC GGGT GGAGAT CAAT GGACAGG AT CT GAAAAT GGACTGCAAGGAG 840 

Qy 841 T ACAACT AT GACAAGAGCATT GT G GACAGT GGCAC CAC CAAC CT T C GT T T GCC CAAGAAA 900 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 
Db 841 T ACAACT AT GACAAGAGCATT GT G GACAGT GGCAC CAC CAAC CT T C GT T T GCC CAAGAAA 900 

Qy 901 GT GTT T GAAGCT GC AGT CAAAT C CAT CAAG GCAGC CT C CT C CAC GGAGAAGTT C C CT GAT 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
Db 901 GT GTT T GAAGCT GC AGT CAAAT C CAT CAAG GCAGC CT C CT C CAC GGAGAAGTT C C CT GAT 960 

Qy 961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 1020 

Qy 1021 T T C CCAGT CAT CT C ACT CT AC CT AAT GGGT GAGGT T AC CAAC C AGT C CT T C CGCAT CAC C 1080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1021 TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCACC 1080 

Qy 1081 AT C CTT C C GCAGCAAT ACCT GC GGC C AGT GGAAGAT GT GGC CAC GT C C CAAGAC GACT GT 1140 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1081 AT C CTT C C GCAGCAAT AC CT GC GGC C AGT GGAAGAT GT GGC CAC GT C C CAAGAC GACT GT 1140 

Qy 1141 T ACAAGT T T G C CAT CT CAC AGT CAT C CAC GG G C ACT GTT AT GGGAGCT GT TAT CAT GGAG 1200 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1141 T ACAAGT T T G C CAT CT CAC AGT CAT C CAC GGG C ACT GT TAT GGGAGCT GT TAT CAT GGAG 1200 

Qy 1201 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1260 

I I I I I I I I I II I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1201 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1260 

Qy 12 61 CAT GT GCACGAT GAGTT CAGGAC G GCAGC GGT GGAAG GCCCT TT T GT CAC CTT GGAC AT G 1320 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 12 61 CAT GT GCAC GAT GAGTT CAGGAC G GCAGC G GT GGAAGGCCCT TT T GT CAC CTT GGAC AT G 1320 

Qy 1321 GAAGACT GT GGCT ACAACATT C CACAGACAGAT GAGT CAAC C C T CAT GAC CAT AGC CTAT 1380 

I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 

Db 1321 GAAGACT GT G GCT ACAACAT T C CACAGACAGAT GAGT CAAC C CT CAT GAC CAT AGC CTAT 1380 

Qy 1381 GT CAT GGCT GCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCAT GGT GTGTCAGTGG 1440 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II 
Db 1381 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 1440 

Qy 14 41 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1441 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 

Qy 1501 AAGT GAGGAGGC C CAT GG G CAGAAGAT AGAGAT T C C C CT GGAC C AC AC CT C CGT GGTT C A 1560 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1501 AAGT GAG GAGGC C CAT GGGC AGAAGAT AGAGATT C C C CT GGAC C AC AC CT C CGT GGTT C A 1560 


Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 


1561 CTTTGGT CACAAGTAGGAGACACAGATGGCACCT GT GGCCAGAGCACCT CAGGACCCT CC 1620 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1561 CTTTGGT CACAAGTAGGAGACACAGATGGCACCT GT GGCCAGAGCACCT CAGGACCCT CC 1620 

1621 CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCAG 1680 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1621 CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCAG 1680 

1681 GGACTGTACCTGTAGGAAACAGAAAAGAGAAGAAAGAAGCACTCTGCTGGCGGGAATACT 1740 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1681 GGACT GTACCTGT AGGAAACAGAAAAGAGAAGAAAGAAGCACTCT GCTGGCGGGAATACT 1740 

1741 CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCT 1800 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1741 CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCT 1800 

1801 TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 1860 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I 

1801 TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 1860 

1861 GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAAG 1920 

I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
18 61 GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAAG 1920 

1921 AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCTATT 1980 

I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1921 AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCTATT 1980 

1981 T GCT T T AGAGAC AG GGACT GT AT AAACAAGC CT AAC AT T GGT GCAAAGATTGC CT CT T GA 2040 

I I I I I I I I I I II I I M I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1981 T GCT T TAGAGAC AG GGACT GT AT AAACAAGC CT AAC AT T GGT GCAAAGATTGC CT CT T GA 2040 

2041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2070 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
2041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2070 


RESULT 6 
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Matches 2070; Conservative 0; Mismatches 0; Inciels 0; Gaps 0; 


Qy 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I 

Db 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 


Qy 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 120 


Qy 121 CT G C G GCT GC C C CGGGAGACC GAC GAAGAG C CC GAGGAGC C C GG C C GGAGGGGC AGCT T T 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I 

Db 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 180 


Qy 181 GT GGAGAT GGT GGACAACCTGAGGGGCAAGT CGGGGCAGGGCT ACT ACGT GGAGAT GACC 24 0 

I I I I I I ! I I M I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I 
Db 181 GT GGAGAT GGT GGACAACCTGAGGGGCAAGT CGGGGCAGGGCT ACT ACGT GGAGAT GAC C 240 


Qy 241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 300 

I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 

Db 241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 300 


Qy 301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 
Db 301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 


Qy 361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 420 

I I I I I I I I I I I I I I I I I I IE I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 420 


Qy 421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 4 80 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I 

Db 421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 4 80 


Qy 481 GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATCCTG 54 0 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATCCTG 54 0 


Qy 


541 


GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 


600 


541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

661 CCCCT CAACCAGT CT GAAGT GCT GGCCT CTGTCGGAGGGAGCAT GATCATTGGAGGTATC 72 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
661 CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGGTATC 72 0 

721 GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTATTAT 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I 
721 GACCACT CGCT GTACACAGGCAGT CTCT GGTATACAC CCAT CCGGC GGGAGTGGTATTAT 7 80 

781 GAGGT CAT CAT T GT GC GGGT GGAGAT CAAT GGAC AG GAT C T GAAAAT GGACT GCAAG GAG 840 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II 
781 GAG GT CAT CAT T GT G C G GGT GGAGAT CAAT GGACAGGAT C T GAAAAT G GACT GCAAGGAG 840 

841 T ACAACT AT GACAAGAGCAT T GT G GACAGT GGCAC C AC CAACCTT C GT T T GC C CAAGAAA 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

841 T ACAACT AT GACAAGAGCAT T GT GGAC AGT GGCAC C AC CAACC TT C GT TTG CC CAAGAAA 900 

901 GT GT TT GAAGCT GCAGT CAAAT C C ATCAAGG C AGC CT C CT C C AC GGAGAAGTT C C C T GAT 9 60 

M I I I I I I I I I M I I I I I I I I II I I I I I I I M I I I I I I I I I I I I II I I I I I I I I I I I I I I 

901 GT GT T T GAAGCT GCAGT CAAAT C CAT CAAGG C AGC CT C CT C C AC GGAGAAGT T C C C T GAT 9 60 

961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 102 0 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 102 0 

1021 TT C C CAGT CAT CT C AC T C TAC C TAAT GGGT GAGGT T AC CAACCAGT C CT T C CGC AT C AC C 108 0 

I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
1021 T T C C CAGT CAT C T C ACT C TAC C TAAT G GGT GAGGT TAC CAACCAGT C CTT C C G CAT C AC C 108 0 

1081 AT C CTT C C GC AG CAAT AC CT GC GGCC AGT GGAAGAT GT GGC C AC GT C C CAAGAC GACTGT 114 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 
1081 AT C CTT C C GC AG CAAT AC CT GC GGCC AGT GGAAGAT GT GGC C AC GT C C CAAGAC GACT GT 114 0 

1141 T ACAAGT TTGC CAT CT C ACAGT CAT C C AC GGGCACT GT TAT GGGAGCT GTT AT CAT GGAG 1200 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I 
1141 T ACAAGTTTGCCAT CT C ACAGT CATCCAC GGGCACT GTT AT GGGAGCT GTT AT CAT GGAG 1200 

1201 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1260 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I 
1201 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1260 

12 61 CAT GT GC ACGAT GAGT T C AG GAC GGCAGC GGT GGAAGGC C CTT T T GT CACCT T G GAC AT G 132 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I M I I I I I I I I I I 
12 61 CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGACATG 132 0 

1321 GAAGACT GT G G CT ACAAC ATT C C ACAGACAGAT GAGT CAAC CCT CAT GAC CAT AG C CT AT 1380 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
1321 GAAGACT GT G GCT ACAAC ATT C CACAGAC AGAT GAGT CAACCCT CAT GACCAT AGC CT AT 1380 

1381 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 144 0 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1381 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 144 0 


Qy 1441 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1441 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 15 00 

Qy 1501 AAGT GAGGAG GC C CAT GGGC AGAAGAT AGAGAT T CCC CT GGAC CACAC CT C CGT G GT T C A 1560 

I I I I I I I I I M I I I I I I I I II I I I I I I I I I I I I I I I I I M I I I I I I I I I I II II I I I I I I 
Db 1501 AAGT GAGGAGGCC CAT GGGCAGAAGATAGAGATT CCCCT GGACCACACCT CCGTGGTT CA 1560 

Qy 1561 CTT T GGT C ACAAGT AGGAGACACAGAT GGCAC CT GT GGC C AGAGC AC CT C AGGAC C CT C C 1620 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 
Db 1561 CTT T GGT CACAAGT AGGAGACACAGAT GGCAC CT GT GGC C AGAGC AC CT C AGGAC C CT C C 1620 

Qy 1621 CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCAG 168 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1621 C CAC C C AC CAAAT GCCTCTGCCTT GAT G GAGAAG GAAAAG GCT GGC AAGGTGG GT T C C AG 168 0 

Qy 1681 GGACT GTAC CT GT AGGAAACAGAAAAGAGAAGAAAGAAGC ACTCT GCT GG CGGGAAT ACT 1740 

I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1681 GGACT GTAC CT GT AGGAAAC AGAAAAGAGAAGAAAGAAGC ACTC T GC T GGCGGGAAT ACT 1740 

Qy 1741 CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTG7\AACTTCAGCCCTGAACCT 18 00 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1741 CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCT 18 00 

Qy 18 01 TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 18 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1801 TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAG7\A 1860 

Qy 1861 GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAAG 1920 

I I I I I I I I I I I I I I I I I I II I II I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I 
Db 1861 GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAAG 1920 

Qy 1921 AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCTATT 198 0 

I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1921 AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCTATT 198 0 

Qy 1981 TGCTTTAGAGACAGGGACT GTATAAACAAGC CTAACATT GGTGCAAAGATTGCCT CTT GA 2 040 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1981 TGCTTTAGAGACAGGGACT GTATAAACAAGC CTAACATT GGTGCAAAGATTGCCT CTT GA 2040 

Qy 2041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2070 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2070 
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Query Match 100.0%; Score 2070; DB 11; Length 20-70; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 2 070; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

Qy 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 12 0 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I 
Db 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 120 

Qy 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | 
Db 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 180 

Qy 181 GT GGAGAT G GT GGACAAC CT GAGGGGCAAGT C GG GGC AGGGCT ACT AC GTGGAGAT GAC C 24 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 181 GT GGAGAT GGT GGACAAC CT GAG GGGCAAGTCGGGGCAGGGCTACTAC GTGGAGAT GAC C 240 

Qy 241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 300 

I I M I I I I M I I I I I I II I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II 
Db 241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 300 

Qy 301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I II I I I I 
Db 301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

Qy 361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 420 

I I I I I I II I I I I M I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 420 

Qy 421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCTUVCATT 480 

I I I I I M I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 480 


Qy 


481 


GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATCCTG 54 0 
I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I 


Db 4 81 GCT GC C AT C AC T GAAT C AGACAAGT T CT T CAT CAAC GGC T C CAACT GGGAAG GCAT CC T G 54 0 

Qy 541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

Qy 601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

Qy 661 C C C C T CAAC CAGT CT GAAGT GCTGGCCTCTGT C GGAGGGAGCAT GAT CAT T GGAGGT AT C 72 0 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 661 CCCCTCAACCAGTCTGAAGT GCT GGCCTCTGTCGGAGGGAGCAT GAT CATT GGAGGT AT C 72 0 

Qy 721 GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTATTAT 780 

I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 

Db 721 GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTATTAT 78 0 

Qy 781 GAGGT CAT CAT T GT GC GGGT GGAGAT CAAT GGAC AGGAT CT GAAAAT GGACT GCAAG GAG 84 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 781 GAGGT CAT CATT GT GCGGGT GGAGAT CAAT GGACAGGAT CT GAAAAT GGACT GCAAGGAG 84 0 

Qy 841 TACAACT AT GACAAGAGC AT T GT GGACAGT G GC AC C AC CAAC CT T CGTT T GC C CAAGAAA 900 

I I M I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 

Db 841 T ACAAC TAT GACAAGAGC AT T GT GGACAGT GGC AC C AC CAAC CTT CGT T T GC C CAAGAAA 90 0 

Qy 901 GT GT T T GAAGCT GCAGT CAAAT C CAT CAAGGC AGC CT C CT C C AC GGAGAAGT T C CCT GAT 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I M II I I I I I I I I I I I I I I I I I II I I I I I I I I I 

Db 901 GT GT T T GAAGCT G CAGT CAAAT C CAT CAAGGC AGC CT C C T C C AC GGAGAAGT T C CCT GAT 960 

Qy 961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 1020 

I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I M 

Db 961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 1020 

Qy 1021 T T C C CAGT CAT CT CACT CT AC C T AATGGGT GAG GT T AC CAAC CAGT C CTT C C GC AT C AC C 1080 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1021 TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCACC 1080 

Qy 1081 AT C CTT CC G CAGCAAT AC C T GC GGC CAGT G GAAGAT GT GGC C AC GT C C CAAGAC GACT GT 1140 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1081 ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGACTGT 1140 

Qy 1141 T AC AAGTT T GC CAT CT C AC AGT C AT CCAC G GGC ACT GT TAT GGGAGCT GT TAT CAT GGAG 1200 

I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1141 T ACAAGTT T GC CAT CT C AC AGT CAT CCAC GGGC ACT GT TAT GG GAGCT GT TAT CAT GGAG 1200 

Qy 1201 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1260 

I I I I I I I I I I II I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I II I I 

Db 1201 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1260 

Qy 1261 CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGACATG 1320 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I 

Db 1261 CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGACATG 1320 

Qy 1321 GAAGACTGT GGCTACAACATT CCACAGACAGAT GAGTCAACCCT CAT GACCATAGCCTAT 1380 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1321 GAAGACTGTGGCTACAACATT CCACAGACAGAT GAGTCAACCCT CAT GACCATAGCCTAT 1380 


Qy 1381 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 1440 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1381 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 1440 

Qy 14 41 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 14 41 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 

Qy 1501 AAGTGAGGAGGCCCATGGGCAGAAGATAGAGATTCCCCTGGACCACACCTCCGTGGTTCA 1560 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
Db 1501 AAGTGAGGAGGC CCATGGGCAGAAGATAGAGATT CC CCTGGAC CACACCT CCGTGGTTCA 1560 

Qy 1561 CTTTGGT CACAAGTAGGAGACACAGATGGCACCT GT GGCCAGAGCACCT CAGGACCCT CC 1620 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1561 CTTTGGT CACAAGTAGGAGACACAGATGGCAC CT GT GGCCAGAGCACCTCAGGACCCT CC 1620 

Qy 1621 CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCAG 1680 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I II I M I I I I I I I I I I I I 
Db 1621 CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCAG 1680 

Qy 1681 GGACT GT ACCT GTAGGAAACAGAAAAGAGAAGAAAGAAGCACTCT GCT GGCGGGAATACT 1740 

I I II I I I I II I I I I I I I I I II I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I II I I I I 
Db 1681 GGACT GT AC CT GT AGGAAAC AGAAAAGAGAAGAAAGAAGC ACT CT GCT GGC GGGAAT ACT 17 40 

Qy 1741 CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCT 1800 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 1 I I 
Db 1741 CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCT 1800 

Qy 1801 TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 1860 

I II I I I I I M I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I 
Db 1801 TTGTCCACCATTCCTTTAAATTCTCCT^CCCTWVGTATTCTTCTTTTCTTAGTTTCAGAA 1860 

Qy 18 61 GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAAG 1920 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
Db 1861 GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAAG 1920 

Qy 1921 AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCTATT 1980 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I 
Db 1921 AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCTATT 1980 

Qy 1981 T GCT T T AGAGAC AGGGACT GT AT AAACAAGC CTAACATT GGT GCAAAGATT GCCT C TT GA 2040 

I I I I I I I I I I I I I I I II I I I I I I II M M I I I II I I I I I I I I I I II I I I I I I I I I I I I I I 
Db 1981 T GCT TT AGAGAC AGGGACT GT AT AAACAAGC CTAACATT GGT GCAAAGATT G C CT C T T GA 2040 

Qy 2041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2070 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
Db 2041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2070 
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; FILE REFERENCE: 28341/6280A 

; CURRENT APPLICATION NUMBER: US/09/54 8,366 

; CURRENT FILING DATE: 2000-04-12 

PRIOR APPLICATION NUMBER: 60/155,493 

PRIOR FILING DATE: 1999-09-23 
; PRIOR APPLICATION NUMBER: 09/404,133 
; PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: PCT/US99/208 81 

PRIOR FILING DATE: 1999-09-23 
; PRIOR APPLICATION NUMBER: 60/101,594 

PRIOR FILING DATE: 1998-09-24 
; NUMBER OF SEQ ID NOS: 65 
; SOFTWARE: PatentlnVer. 2.0 
; SEQ ID NO 3 

LENGTH: 2 07 0 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-09-548-366-3 


Query Match 100.0%; Score 2070; DB 11; Length 2070; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 2070; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 


Qy 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

I I II I I I I I I I I I I II I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
Db 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 


Qy 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 120 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 120 


Qy 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 18 0 

Qy 181 GT GGAGAT GGT GGACAAC C TGAGGG GCAAGT CG GGGCAGG GCT ACT AC GT GGAGAT GAC C 240 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I 
Db 181 GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGATGACC 24 0 


Qy 241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 300 


Qy 301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 3 60 


Qy 361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I 

Db 361 T AC C GGGAC CT C C G GAAG G GT GT GT AT GT G C CCT AC AC C CAG GGCAAGT GG GAAGGG GAG 420 


Qy 421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 

Db 421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAAGATT 480 

Qy 4 81 GCT GC CATCACT GAAT CAGACAAGT T CTT C AT CAAC G GCT C CAAC T GGGAAGGCAT CC T G 54 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 4'81 GCT G C CAT CACT GAAT CAGACAAGT T CTT CAT CAAC GGC T C CAACT G GGAAG GCAT CCT G 54 0 

Qy 541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 

Db 541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

Qy 601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I i I I I I I I I 
Db 601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

Qy 661 CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGGTATC 72 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
Db 661 C C C CT CAAC C AGT C T GAAGT GCT G GC CT CT GT C GGAGGGAGC AT GAT CAT T G GAGGT AT C 72 0 

Qy 721 GACCACTCGCT GTACACAGGCAGT CT CT GGT ATACACCCAT CCGGCGGGAGT GGTATT AT 78 0 

I II II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 1 I I I I M I I I I I I I I I I I 
Db 721 GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTATTAT 78 0 

Qy 781 GAGGT CAT CAT T GT GC GGGT G GAGAT CAAT GGAC AGGAT CT GAAAAT GGACT GCAAG GAG 84 0 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 781 GAGGT CAT CAT T GT GC GGGT GGAGAT CAAT GGAC AGGAT CT GAAAAT GGACT GCAAGGAG 84 0 

Qy 841 T ACAACT AT GACAAGAGC AT T GT GGAC AGT G GC AC CAC CAAC CT T C GT T T G C C CAAGAAA 900 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II II I I I I I I I I I I I I I I I I 
Db 841 T ACAACT AT GACAAGAGC AT T GT GGACAGT GGCAC CAC CAAC CT T CGT T T GC C CAAGAAA 900 

Qy 901 GT GT T T GAAGCT GCAGT CAAAT C CAT CAAGGCAG C CT C CT C CAC GGAGAAGT T C C CT GAT 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 901 GT GT TT GAAG CT GCAGT CAAAT C CAT CAAG GCAGC CT C CT C CAC G GAGAAGT T C C CT GAT 960 

Qy 961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I II I I I I I I 
Db 961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 1020 

Qy 1021 T T C C CAGT CAT CT CACT CT AC CTAAT GGGT GAGGT T AC CAAC C AGTCCT T C C GCAT CAC C 1080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 1021 T T C C CAGT CAT CT CACT CT AC CTAAT GGGT GAGGT T AC CAAC C AGTCCT T C C G CAT CAC C 1080 

Qy 1081 ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGACTGT 1140 

I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I II I I I I I I I I I I I I I I I I II I I I 
Db 1081 AT C CTT CC GC AG CAAT AC CT GC GGC CAGT G GAAGAT GT GGC CAC GTCC CAAGAC GACT GT 1140 

Qy 1141 T ACAAGTT T GC CAT CT CAC AGT C AT CCAC GG GCACT GTT AT G G GAGC T GT TAT CAT GGAG 12 00 

I I I I I I I I I I I I I I II I I I I I I M I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
Db 1141 T ACAAGTT T GC CAT CT CAC AGT CAT CCAC GG GCACT GT TAT GG GAGCT GT TAT CAT GGAG 1200 

Qy 1201 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1260 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1201 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1260 


Qy 1261 CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGACATG 1320 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 

Db 1261 CAT GT GCAC GAT GAGT T C AG GAC GGCAGC GGT GGAAGGC C CT TTT GT CAC CT T GGAC AT G 1320 

Qy 1321 GAAGACT GT GGC T ACAACATT C CACAGAC AGAT GAGT CAAC C CT CAT GAC CAT AGC CT AT 1380 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1321 GAAGACT GT GGCTACAAC AT T C CAC AGAC AGAT GAGT CAAC C CT CAT GAC CAT AG C CT AT 1380 

Qy 1381 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 14 4 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 

Db 1381 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 144 0 

Qy 1441 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 15 00 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 14 41 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 15 00 

Qy 1501 AAGT GAGGAGGC C CAT G GGCAGAAGAT AGAGAT T C C CCT GGAC CAC ACCT C C GT G GT T CA 1560 

I I I I I I I I I M I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1501 AAGT GAGGAGGCCCAT GGGCAGAAGATAGAGATT CC CCT GGACCACACCT CCGTGGTT CA 1560 

Qy 1561 CT T T G GT C AC AAGT AGGAGACAC AGAT GGC AC C T GT GGC C AGAGCAC CT C AGGAC C CT C C 1620 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1561 CT T T GGT C ACAAGT AGGAGACACAGAT GGC AC CT GT GGC CAGAGC AC CT C AG GAC C C T C C 162 0 

Qy 1621 CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCAG 168 0 

I I I I I I I I I I I I I I I I I I I I I I i II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 
Db 1621 CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCAG 168 0 

Qy 1681 G GAC T GT AC CT GTAGGAAACAGAAAAGAGAAGAAAGAAGC ACT CT GC T GGC GG GAAT ACT 17 4 0 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II 
Db 1681 G GAC T GT ACCT GTAGGAAACAGAAAAGAGAAGAAAGAAGC ACT CT GC T GGC GGGAAT ACT 174 0 

Qy 1741 CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCT 18 00 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I M I I I I I I I I I I I I I I I 
Db 1741 CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCT 18 00 

Qy 1801 TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 18 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1801 TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 18 60 

Qy 1861 GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAAG 192 0 

I I I I I II I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1861 GT ACT GGC AT CAC AC GCAGGT T AC CT T GGC GT GT GT CC CT GT GGT AC CC T GG C AGAGAAG 1920 

Qy 1921 AGAC CAAGCTT GT TT CC CT GCT GGC CAAAGT C AGT AGGAGAGGAT GCACAGT TT G CT ATT 198 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I 
Db 1921 AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCTATT 198 0 

Qy 1981 T GCT T T AGAGACAGG GACT GT AT AAACAAGCCT AAC AT T GGT GCAAAGAT TGCCTCTT GA 2 04 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I 
Db 1981 T GCT T T AGAGACAGG GACT GT AT AAACAAGCCT AAC AT T GGT GCAAAGAT TGCCTCTT GA 2 04 0 

Qy 2 041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 207 0 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 
Db 2 041 ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2 07 0 


RESULT 


9 


US-09-969-671A-1 

; Sequence 1, Application US/09969671A 

; Publication No. US20030036112A1 

; GENERAL INFORMATION: 

; APPLICANT: CHAPMAN, CONRAD G. 

; APPLICANT: MURPHY, KAY 

; APPLICANT: POWELL, DAVID J. 

; APPLICANT: SMITH, TRUDI S. 

; TITLE OF INVENTION: ASP2 

; FILE REFERENCE: GH-70368-D1 

; CURRENT APPLICATION NUMBER: US/09/969, 671A 

; CURRENT FILING DATE: 2001-10-03 

; PRIOR APPLICATION NUMBER: UK 9701684.4 

; PRIOR FILING DATE: 1997-01-28 

PRIOR APPLICATION NUMBER: 09/009,191 
; PRIOR FILING DATE: 1998-01-20 
; PRIOR APPLICATION NUMBER: 09/694,200 
; PRIOR FILING DATE: 2000-10-23 
; NUMBER OF SEQ ID NOS : 6 

; SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 1 

LENGTH: 2 541 

TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 

NAME/KEY: unknown 

LOCATION: (2455) (2456) (2463) (2478) (2480) (2497) (2507) (2509) (2512) (2516) 
; LOCATION: (2520) (2522) (2525) (2529) (2539) (2540) 

OTHER INFORMATION: Wherein n can be represented by a, c, t, or g 
US-09-969-671A-1 


Query Match 98.2%; Score 2032; DB 11; Length 2541; 

Best Local Similarity 99.5%; Pred. No. 0; 

Matches 2049; Conservative 0; Mismatches 10; Indels 1; Gaps 1; 

Qy 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

I I I I I I I I I M I I I I I I I I I I I I I I M II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
Db 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 


Qy 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 12 0 

II I I I II I I I I I I I I I II I I I I I I I I I I I II I I I I I I I II I II I I I I II I I I I I I I I I I I 
Db 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 12 0 


Qy 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I 

Db 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 18 0 


Qy 181 GT GGAGAT G GTGGACAACCT GAGGGGCAAGT C GGG GCAG GGCT ACT AC GT G GAGAT GAC C 24 0 

I I I I II I I I I I I I I II I I I I I II I I I I I I I I I I I I I II I I II I I I I I M I I I I I I I I I I I 

Db 181 GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGATGACC 24 0 


Qy 241 GTGGGCAGCCCCCCGCAGACGCTCJ^ACATCCTGGTGGATACAGGCAGCAGTTUVCTTTGCA 30 0 

I I I I M I I I I I I I I II II I I I I I I I I I II I I I I I I I II I I I I II I I I II I I I I I I I I II I 
Db 241 GT GGGCAGC C CC C C G CAGAC G CT CAAC AT C CT GGT GGATAC AG GC AGC AGTAACT T T GCA 300 


Qy 


301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 
I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 


Db 301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

Qy 361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 42 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 361 TACCGGGACCTCCGGAAGGGTGTGTATGAGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 42 0 

Qy 421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 48 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 

Db 421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 480 

Qy 4 81 GCT GC CAT C AC T GAAT C AGACAAGT TCT T CAT CAACGGCT C CAACT G GGAAGGCATC C T G 54 0 

I I I I I I I i II I II I i I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I i I I I I I I I I II 

Db 4 81 GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATCCTG 54 0 

Qy 541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

I I I I I I I I I I I I I I I I I I I I II I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

Qy 601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
Db 601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCAGCTTTGTGGTGCTGGCTTC 660 

Qy 661 CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGGTATC 72 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I ! I I I I I I I I I I I 
Db 661 C C C C T CAAC CAGT CT GAAGT GCT G GC C T CT GT C GGAGGGAGC AT GAT C ATT GGAGGT AT C 72 0 

Qy 721 GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTATTAT 780 

I I I I I I M I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I | | | | 
Db 721 GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTATTAT 780 

Qy 7 81 GAGGT CAT CATT GT GC GGGT GGAGAT CAAT GGACAGGAT CTGAAAAT GGACT GCAAGGAG 84 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 781 GAGGT GAT CAT T GT GC G GGT G GAGAT CAAT GGACAGGAT CT GAAAAT GGACT GCAAGGAG 840 

Qy 841 T ACAACT AT GACAAG AG CATT GT GGACAGT GGCAC CAC CAAC CT T CGTT T GCC CAAGAAA 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I 

Db 841 TACAACTAT GACAAGAGCATTGT GGACAGT GGCAC CACCAACCTT CGTTT GCCCAAGAAA 900 

Qy 901 GT GT TT GAAGCT G CAGT CAAAT C CAT CAAG GC AGC CT C CT CC AC GGAGAAGT T C C CT GAT 960 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 901 GTGTTT GAAGCT GCAGT CAAAT CCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCT GAT 960 

Qy 961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 1020 

I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 1020 

Qy 1021 TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACC7\ACCAGTCCTTCCGCATCACC 1080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1021 T T C C CAGT CAT C T CACT CT AC C TAAT GGGT GAGGT T AC CAAC CAGT C CT T CC GCAT CAC C 1080 

Qy 1081 ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGACTGT 114 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I 

Db 1081 ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGACTGT 1140 


Qy 

Db 


1141 
1141 


T ACAAGT TT GC C AT CT C ACAGT CAT C C AC GGGCACT GT T AT GGGAGC T GT TAT CAT GGAG 
I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I | | | I I | I | | | I I I I I I I | | | | | | | | I I I I 
T ACAAGT T T GC CAT CT C AC AGT CAT C CAC G GGCACT GT TAT GGGAGC T GT TAT CAT GGAG 


1200 
1200 


Qy 12 01 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1260 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I H I I I I I I II I I I I 

Db 1201 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1260 

Qy 12 61 CAT GT GCAC GAT GAGT T C AGGAC GGC AG C GGTGGAAGGCC C T T TTGT CAC CT T G GACAT G 1320 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 12 61 CAT GT GCAC GAT GAGT T C AGGAC GGC AG C GGTGGAAGGCC CT TT T GT CAC CT T GGACAT G 1320 

Qy 1321 GAAGACT GT GGC T ACAAC AT T C C AC AGAC AGAT GAGT CAAC C CT C AT GAC CAT AG C CT AT 1380 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 
Db 1321 GAAGACT GT GG CT ACAAC AT T C C ACAGACAGAT GAGT CAAC C CT CAT GAC CAT AGC CT AT 1380 

Qy 1381 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 1440 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1381 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 1440 

Qy 1441 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 14 41 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 

Qy 1501 AAGT GAGGAGGC C CAT GGGC AGAAG ATAGAGAT T C C C CT - GGAC CAC AC CT C C GT GGT T C 1559 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1501 AAGT GAGGAGGC C CAT GGGAGAAAGAT AGAG AT T C C C CT GGGAC CAC AC CT C C GT GGTT C 1560 

Qy 1560 ACT T T GGT C ACAAGTAGGAGACAC AGAT GGC AC CT GT GGC CAGAGC AC CT CAGGAC CCT C 1619 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1561 ACT T T G GT C ACAAGTAGGAGACAC AGAT G GC AC CT GT GGC CAGAGC AC CT CAGGAC CCT C 1620 

Qy 1620 CCCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCA 1679 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1621 CCCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCA 1680 

Qy 1680 GGGACT GT AC CT GTAGGAAACAGAAAAGAGAAGAAAGAAGCACT CTGCT GGCGGGAAT AC 1739 

I I I I I I I I I I I I I I I I I I I I M I I I II I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I 
Db 1681 GGGACT GT AC CT GT AGGAAACAGAAAAGAGAAGAAAGAAGCACT CT G CT GGC G GGAAT AC 1740 

Qy 1740 TCTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACC 1799 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 17 41 TCTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACC 1800 

Qy 18 00 TTTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGA 1859 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 

Db 18 01 TTTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGA 1860 

Qy 18 60 AGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAA 1919 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 18 61 AGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCGGGCAGAGAA 1920 

Qy 1920 GAGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCTAT 1979 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1921 GAGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCTAT 1980 

Qy 1980 TT G CT T TAGAGACAGGGACT GT ATAAACAAGC CTAACATT GGT GCAAAGATT GC CT CT T G 2 039 

I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
Db 1981 TT G CT T TAGAGACAGGGACT GT AT AAACAAG C CT AAC AT T GGT GCAAAGATT GC CT CT T G 2 040 


Qy 2040 AATTAAAAAAAAAAAAAAAA 2059 

I I I I I I I I I I I I I I I I I 
Db 2041 AAT T AAAAAAAAAAAC TAG A 2060 


RESULT 10 
US-10-308-365-1 

; Sequence 1, Application US/10308365 

; Publication No. US20030109022A1 

; GENERAL INFORMATION: 

; APPLICANT: CHAPMAN, CONRAD G. 

; APPLICANT: MURPHY, KAY 

; APPLICANT: POWELL, DAVID J. 

; APPLICANT : SMITH, TRUDI S. 

; TITLE OF INVENTION: ASP 2 

; FILE REFERENCE: GH-70368-2 

; CURRENT APPLICATION NUMBER: US/10/308,365 
; CURRENT FILING DATE: 2002-12-03 
; PRIOR APPLICATION NUMBER: US/09/694, 200 
; PRIOR FILING DATE: 2000-10-23 

PRIOR APPLICATION NUMBER: UK 9701684.4 

PRIOR FILING DATE: 1997-01-28 

PRIOR APPLICATION NUMBER: 09/009,191 

PRIOR FILING DATE: 1998-01-20 
; NUMBER OF SEQ ID NOS : 6 

SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 1 

LENGTH: 2541 
TYPE: DNA 

ORGANISM: HOMO SAPIENS 
; FEATURE: FEATURE: MISCELLANEOUS FEATURE 
; NAME/ KEY: UNSURE 

LOCATION: (2455) (2456) (2463) (2478) (2480) (2497) (2507) (2509) (2512) (2516) (2520) 

LOCATION: (2522) (2525) (2529) (2539) (2540) 

OTHER INFORMATION: OTHER INFORMATION: n= a, g, c or t 
US-10-308-365-1 


Query Match 98.2%; Score 2032; DB 15; Length 2541; 

Best Local Similarity 99.5%; Pred. No. 0; 

Matches 2049; Conservative 0; Mismatches 10; Indels 1; Gaps 1; 

Qy 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 


Qy 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 12 0 

M II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 12 0 


Qy 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 

Db 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 18 0 


Qy 181 GT GGAGAT GGT GGACAAC CT GAG GGGCAAGT C GG GG CAGGGCT ACT AC GT GGAGAT GAC C 24 0 

I M I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 
Db 181 GT GGAGAT GGT GGACAAC CT GAG GGGCAAGT CGGGGCAGGGCT ACT ACGT GGAGAT GACC 24 0 


Qy 241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 30 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 30 0 

Qy 301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 

Db 301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

Qy 361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 42 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 361 TACCGGGACCTCCGGAAGGGTGTGTATGAGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 42 0 

Qy 421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 48 0 

I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 48 0 

Qy 4 81 GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCC7^CTGGGAAGGCATCCTG 54 0 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 481 GCT GC C AT CACTGAAT C AGACAAGT T CT T CAT CAAC GGCTC CAACT GGGAAGGC AT C CT G 54 0 

Qy 541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

I I I I I I I I I I II I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

Qy 601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

I II I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 

Db 601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCAGCTTTGTGGTGCTGGCTTC 660 

Qy 661 C C C CT CAAC CAGT C T GAAGT GC T GGC CT CT GT C G GAGGGAGCAT GAT CAT T GGAGGT AT C 72 0 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 661 CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGGTATC 72 0 

Qy 721 GAC C AC T C GCT GT ACAC AGGC AGT CT CT GGT AT AC AC C CAT C C GGC GGGAGT GGT AT TAT 78 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 

Db 721 GAC C ACT C GCT GT ACAC AGGC AGT CTCT GGT AT ACAC C CAT CC GGC GGGAGT GGT AT TAT 78 0 

Qy 781 GAGGT CAT CATTGT GCGGGT GGAGATCAAT GGACAGGAT CT GAAAATGGACT GCAAGGAG 84 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 781 GAGGT GAT CATT GT GC G GGT GGAGAT CAAT G GAC AGGAT CT GAAAAT GGACT GCAAG GAG 84 0 

Qy 841 T ACAACT AT GACAAGAG CAT T GT GGACAGT GGC AC CAC CAAC CT T C GT T T GC CCAAGAAA 90 0 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 

Db 841 T ACAACT AT GACAAGAG CAT T GT GGACAGT G GC AC CAC CAAC CT T C GTT T G C C CAAGAAA 90 0 

Qy 901 GT GTTT GAAGCT GCAGT CAAAT C CAT CAAGGC AG C CT C CT C CAC GGAGAAGT T CC CT GAT 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I II I I I I I 

Db 901 GT GTT T GAAGCT GCAGT CAAAT C CAT CAAGG C AG C CT C CTC CAC GGAGAAGT T CC CT GAT 960 

Qy 961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 102 0 

I I I I II I I I I I I I I I II I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I 

Db 961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 102 0 

Qy 1021 TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCACC 10 8 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1021 T T C CC AGT CAT CT CACT CTAC CTAAT GGGT GAGGT T AC CAAC C AGT CCT T C C GC AT CAC C 10 8 0 

Qy 1081 AT C CT T C C GC AGCAAT AC CT GC GGC CAGT GGAAGAT GT GGC CAC GT CC CAAGACGACT GT 114 0 


1081 ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGACTGT 1140 

1141 T ACAAGTTT GC CAT CT C ACAGT CAT C CAC GG GCAC T GT TAT G GGAGCT GT TAT CATGGAG 1200 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1141 T ACAAGTTT GC CAT CTCACAGT CAT CCACGGGCACTGTTAT GGGAGCT GTTAT CATGGAG 1200 

1201 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 12 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 

1201 GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 12 60 

1261 CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGACATG 1320 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 

12 61 CAT GT G CAC GAT GAGTT C AGGAC GGCAG C GGT GGAAGG C C C TTT T GT C AC CT T G GACAT G 1320 

1321 GAAGACTGTGGCTACAACATTCCACAGACAGAT GAGTCAACCCT CAT GACCATAGCCTAT 1380 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 

1321 GAAGACTGTGGCTACAACATTCCACAGACAGAT GAGTCAACCCT CAT GACCATAGCCTAT 1380 

1381 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 1440 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1381 GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 1440 

1441 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 

1441 CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 1500 

1501 AAGT GAGGAG GC C CAT GGGC AGAAGAT AGAGATT C CC CT - GGAC C AC AC CT C C GT GGT T C 1559 

I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I 

1501 AAGT GAGGAGGC C CAT GGGAGAAAGAT AGAGAT T C C C CT GGGAC C AC AC CT C C GTGGT T C 1560 

1560 ACTTT GGT CACAAGTAGGAGACACAGATGGCACCT GT GGCCAGAGCACCT CAGGACC CT C 1619 
I I I I I II I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I II I I I I I M I I I I I I I I I I 

1561 ACTTT GGTCACAAGTAGGAGACACAGAT GGCACCT GT GGCCAGAGCACCT CAGGACCCT C 1620 

162 0 C C CAC C CAC CAAAT G CCT CT G C CT T GAT GGAGAAGGAAAAGGCT GGCAAGGT GG GTT C CA 1679 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1621 C C CAC C CAC CAAAT GCCT CT GCCTT GAT GGAGAAGGAAAAGGCT GGCAAGGT GGGTT CCA 1680 

1680 GG GAC T GTAC CT GT AGGAAAC AGAAAAGAGAAGAAAGAAGC ACT CT GCT GGC GGGAAT AC 1739 
I I I I I I I I I I I II I I I I I I II I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1681 GGGACT GTAC CT GT AGGAAAC AGAAAAGAGAAGAAAGAAGC ACT CTGCTGGC GG GAAT AC 17 40 

1740 TCTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACC 17 99 

I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 

1741 TCTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACC 1800 

18 00 TTTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGA 1859 

I I I II I II I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I M I I I 

18 01 TTTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGA 18 60 

1860 AGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAA 1919 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 

1861 AGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCGGGCAGAGAA 1920 

1920 GAGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCTAT 1979 
I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I 


Db 1921 


GAGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCTAT 198 0 


Qy 1980 T T GCTT T AGAGAC AGGGACT GT AT AAACAAGC CT AACAT T GGT G CAAAGATT GC CT CT T G 2039 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1981 T T GCTT T AGAGAC AGGGACT GT AT AAACAAGC CTAAC AT T GGT G CAAAGAT TGCCTCTTG 204 0 

Qy 2040 AATTAAAAAAAAAAAAAAAA 2059 

I I I I I I I I I I I I I I I I I 
Db 2041 AATTAAAAAAAAAAACTAGA 2060 


RESULT 11 
US-09-796-264-1 

; Sequence 1, Application US/09796264 

; Patent No. US2002004 9303A1 

; GENERAL INFORMATION: 

; APPLICANT: Tang, Jordan J.N. 

; APPLICANT: Lin, Xinli 

; APPLICANT: Koelsch, Gerald 

TITLE OF INVENTION: Catalytically Active Recombinant Memapsin and Methods 
; TITLE OF INVENTION: of Use Thereof 
; FILE REFERENCE: OMRF 179 

; CURRENT APPLICATION NUMBER: US/09/796,264 

; CURRENT FILING DATE: 2001-02-28 

; PRIOR APPLICATION NUMBER: 09/604,608 

; PRIOR FILING DATE: 2000-06-27 

PRIOR APPLICATION NUMBER: 60/168,060 

PRIOR FILING DATE: 1999-11-30 

PRIOR APPLICATION NUMBER: 60/177,836 

PRIOR FILING DATE: 2000-01-25 
; PRIOR APPLICATION NUMBER: 60/178,368 
; PRIOR FILING DATE: 2000-01-27 
; PRIOR APPLICATION NUMBER: 60/210,292 
; PRIOR FILING DATE: 2000-06-08 
; NUMBER OF SEQ ID NOS : 31 
; SOFTWARE: Patentln Ver. 2.1 
; SEQ ID NO 1 

LENGTH: 3252 
; TYPE: DNA 

; ORGANISM: Homo sapiens 
US-09-796-264-1 

Query Match 97.1%; Score 2010; DB 9; Length 3252; 

Best Local Similarity 99.8%; Pred. No. 0; 

Matches 2013; Conservative 0; Mismatches 5; Indels 0; Gaps 0; 

Qy 40 GCGGGAGTGCTGCCTGCCCACGGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGC 99 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 GCGGGAGTGCTGCCTGCCCACGGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGC 60 

Qy 100 CTGGGGGGCGCCCCCCTGGGGCTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAG 159 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I 
Db 61 CTGGGGGGCGCCCCCCTGGGGCTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAG 120 


Qy 

Db 


160 
121 


219 


180 


Qy 220 GGCTACTACGTGGAGATGACCGTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGAT 27 9 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 GGCTACTACGTGGAGATGACCGTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGAT 24 0 

Qy 280 ACAGGCAGCAGTAACTTTGCAGTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTAC 339 

I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 241 ACAGGCAGCAGTAACTTTGCAGTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTAC 30 0 

Qy 340 CAGAGGCAGCT GT C C AGC ACAT AC C GGGAC CT C C G GAAG GGT GT GT AT GT GC C CT AC ACC 39 9 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 

Db 301 CAGAGGCAGCT GT C CAGC ACAT AC C GGGAC C T C C GGAAG GGT GT GT AT GT GC C CT AC AC C 360 

Qy 400 CAGGGCAAGTGGGAAGGGGAGCTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAAC 45 9 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 361 CAGGGCAAGTGGGAAGGGGAGCTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAAC 42 0 

Qy 4 60 GT CACT GT GC GT G C CAAC AT T G CT GC C AT C ACT GAATC AGACAAGTT CTT CAT CAACGG C 519 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I M I I I 

Db 421 GT CACT GT GC GT G C CAAC AT T GCT G C CAT CACT GAAT C AGACAAGTT CTT CAT CAACGGC 480 

Qy 520 TCCAACTGGGAAGGCATCCTGGGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCC 57 9 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I! II I I I I I I I I I I I I I I I I I I 

Db 4 81 TCCAACTGGGAAGGCATCCTGGGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCC 54 0 

Qy 580 CTGGAGCCTTTCTTTGACTCTCTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTG 639 

! I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I II II I I I M I I I I I I I I I I I I 

Db 541 CTGGAGCCTTTCTTTGACTCTCTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTG 600 

Qy 640 CACCTTTGTGGTGCTGGCTTCCCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGG 699 

II I M I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 

Db 601 CAGCTTTGTGGTGCTGGCTTCCCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGG 660 

Qy 700 AGCAT GATCATTGGAGGTAT CGACCACTCGCT GTACACAGGCAGT CTCT GGT AT ACAC CC 759 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 661 AGCAT GATCATT GGAGGT AT C GAC CACT C GCT GT AC ACAG GC AGT CTCT GGT AT ACAC C C 72 0 

Qy 760 AT CCGGCGGGAGT GGT ATT AT GAGGT CAT CATT GTGCGGGT GGAGATCAAT GGACAGGAT 819 

I I I M I I I I I I II I I II I II I I I I I I II I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I 

Db 721 AT CCGGCGGGAGTGGT ATT AT GAGGT GAT CATT GT GCGGGT GGAGATCAATGGACAGGAT 780 

Qy 820 CT GAAAATGGACT GCAAGGAGTACAACTAT GACAAGAGCATT GT GGACAGTGGCACCACC 87 9 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I 
Db 781 CT GAAAAT GGACT GCAAGGAGTACAACTAT GACAAGAGCATT GT GGACAGT GGCAC C AC C 84 0 

Qy 880 AACCTTCGTTTGCCCAAGAAAGTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCC 939 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I II I I I I I I I 

Db 841 AACCTTCGTTTGCCCAAGAAAGTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCC 900 

Qy 940 TCCACGGAGAAGTTCCCTGATGGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCA 999 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 901 TCCACGGAGAAGTTCCCTGATGGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCA 960 

Qy 1000 G GCAC C AC CC CT T GGAAC AT T T T C C C AGT CAT CT C ACT CT AC CT AATGGGT GAGGTT AC C 1059 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I II 
Db 9 61 GG CAC C AC CC CTT GGAACAT T T T C C CAGT CAT CT C ACT C T AC CT AATGGGT GAGGT T AC C 102 0 


Qy 1060 AAC CAGT C CT T C C GCAT C AC CAT C CT T CC GC AGCAAT ACCT GC GGC C AGT GGAAGAT GT G 1119 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1021 AAC CAGT C CT T C C GCAT C AC CAT C CT T CC G C AGCAAT ACCT GC G G C CAGT GGAAGAT GT G 1080 

Qy 112 0 G C C ACGT C C CAAGACGACT GT T ACAAGTT T GC CAT C T CAC AGT CAT C C AC GGGCACT GTT 1179 

I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
Db 1081 G C C ACGT C C CAAGACGACT GT T ACAAGTT T GC CAT C T CAC AGT CAT C CAC GGGCACT GTT 1140 

Qy 118 0 ATGGGAGCTGTTATCATGGAGGGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATT 1239 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1141 ATGGGAGCTGTTATCATGGAGGGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATT 1200 

Qy 12 4 0 GGCTTTGCTGTCAGCGCTTGCCATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGC 12 99 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I 

Db 12 01 GGCTTTGCTGTCAGCGCTTGCCATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGC 1260 

Qy 1300 C CTTTT GT CACCTT GGACAT GGAAGACTGTGGCTACAACATT CCACAGACAGAT GAGT CA 1359 

I I I I I I I I I M I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I 

Db 1261 C CTTT T GT CAC CT T GGACAT GGAAGACT GT G GCT ACAAC AT T C C AC AGAC AGAT GAGT C A 1320 

Qy 1360 ACCCTCATGACCATAGCCTATGTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTC 1419 

I I I I I I t I I ! I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1321 ACCCTCATGACCATAGCCTATGTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTC 1380 

Qy 142 0 TGCCTCATGGTGTGTCAGTGGCGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTT 1479 

I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
Db 1381 TGCCTCATGGTGTGTCAGTGGCGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTT 14 40 

Qy 14 8 0 GCT GAT GAC AT C T C CCT GCT GAAGT GAGGAGGC C CAT GGGCAGAAGAT AGAGAT T C C C CT 1539 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 

Db 14 41 GCT GAT GACAT CT CCCT GCT GAAGT GAGGAGGCC CAT GGGCAGAAGATAGAGATT CCCCT 1500 

Qy 154 0 GGACC AC AC CT C C GT GGT T CACT T T GGT C ACAAGT AG GAGAC AC AG AT GGCAC CT GT GGC 1599 

I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
Db 1501 GGACC AC AC C T C C GT GGT T CACT TT G GT C ACAAGT AGGAGAC AC AGAT GGCAC CT GT GGC 1560 

Qy 1600 CAGAGCAC CT CAGGACC CT C C C CAC C CAC CAAAT GC CT CT GC CT T GAT G GAGAAGGAAAA 1659 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
Db 1561 CAGAGCAC CT CAGGACCCT C C C CAC C CAC CAAAT GC CT CTGC CT T GAT GGAGAAGGAAAA 1620 

Qy 1660 GGCT GGCAAGGT GGGTT C CAGGGACT GTACCT GT AGGAAACAGAAAAGAGAAGAAAGAAG 1719 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1621 G GCT GGCAAGGT GGGT T C CAGGGACT GTAC CT GT AGGAAACAGAAAAGAGAAGAAAGAAG 1680 

Qy 172 0 CACT CT GCT GGC GGGAATACT CTT GGT CACCT CAAATTTAAGT CGGGAAATT CT GCT GCT 1779 

I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 

Db 1681 CACTCTGCTGGCGGGAATACTCTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCT 1740 

Qy 178 0 TGAAACTTCAGCCCTGAAC CTTT GTCCACCATTCCTTTAAATTCTCCAACCCAAAGT ATT 1839 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I J I I I I I I I I I I I I I I I I I I I I I I I 
Db 1741 T GAAACTT CAGC C CT GAAC CTTT GT C CAC CAT T C CT T T AAAT T CT C CAAC CCAAAGT ATT 1800 

Qy 184 0 CTTCTTTTCTTAGTTTCAGAAGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCC 18 99 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 18 01 CTTCTTTTCTTAGTTTCAGAAGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCC 18 60 

Qy 1900 TGTGGTACCCTGGCAGAGAAGAGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGA 1959 


Db 1861 T GT GGT AC C CT G GCAGAGAAGAGAC C AAGC TT GT T T C C CTGCT GGC CAAAGT C AGT AGGA 1920 

Qy 1960 GAG GAT GCAC AGT T T GCT AT T T GCT T T AGAGAC AGGGACTGT ATAAACAAGCCT AACATT 2019 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 

Db 1921 GAGGAT GCAC AGT T T GCT AT T T GCT TTAGAGAC AGGGACT GT ATAAACAAGCCTAACAT T 1980 

Qy 2 02 0 GGT GCAAAGAT TGCCTCTT GAAT TAAAAAAAAAAAAAA 2057 

I I I I I I I I f I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1981 GGT GCAAAGAT TGCCTCTT GAAT TAAAAAAAAACT AGA 2018 
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Query Match 97.1%; Score 2010; DB 10; Length 3252; 

Best Local Similarity 99.8%; Pred. No. 0; 

Matches 2013; Conservative 0; Mismatches 5; Indels 0; Gaps 0; 

Qy 4 0 GCGGGAGTGCTGCCTGCCCACGGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGC 99 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 GCGGGAGTGCTGCCTGCCCACGGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGC 60 

Qy 100 CTGGGGGGCGCCCCCCTGGGGCTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAG 159 

I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 CTGGGGGGCGCCCCCCTGGGGCTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAG 120 

Qy 160 CCCGGCCGGAGGGGCAGCTTTGTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAG 219 

I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 CCCGGCCGGAGGGGCAGCTTTGTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAG 180 


220 GGCTACTACGTGGAGATGACCGTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGAT 27 9 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I 
181 G G C TAC T AC GT GGAGAT GAC CGT G G GCAGC C C C C C GC AGAC GC T CAAC AT C CT GGT GGAT 240 

280 ACAGGCAGCAGTAACTTTGCAGTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTAC 339 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 

241 ACAGGCAGCAGTAACTTTGCAGTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTAC 300 

340 CAGAGGCAGCTGTCCAGCACATACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACC 399 

I M I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 

301 C AGAGGCAGCT GT C C AGC AC AT AC C G GGAC CT C C G GAAGGGT GT GT AT GT GCC C T ACAC C 360 

4 00 CAGGGCAAGTGGGAAGGGGAGCTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAAC 459 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
361 CAGGGCAAGTGGGAAGGGGAGCTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAAC 420 

4 60 GT CACT GT GC GT GCCAAC ATTG C T GC CAT C ACT GAAT C AGACAAGT T CT T CAT CAAC GGC 519 

I I I I I I I I I I I II I I I I I I I I I M I I I I I I I I I I I I i I I I I I I I I I I | I | | | | | | | | | | | 
421 GT C ACT GT G C GT G CCAAC ATT GCT GC C AT CACT GAAT C AGACAAGT T C T T CAT C AACGGC 480 

52 0 TCCAACTGGGAAGGCATCCTGGGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCC 57 9 

I I I I M I I I I I I I II I I I I I I I I I I I I I I I I I I I I M I I I I I II I I I I I I I I I I I | I I I I 
4 81 TCCAACTGGGAAGGCATCCTGGGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCC 54 0 

580 CTGGAGCCTTTCTTTGACTCTCTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTG 639 

I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I I I I I I I 

541 CTGGAGCCTTTCTTTGACTCTCTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTG 600 

64 0 CACCTTTGTGGTGCTGGCTTCCCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGG 699 

II I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I M II I I I I I I I I I I I I I I I 

601 CAGCTTTGTGGTGCTGGCTTCCCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGG 660 

7 00 AG CAT GAT CAT T G GAGGT ATC GAC C ACT C GCT GT AC AC AG GCAGT CT C T GGT AT ACAC C C 759 

M I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I II I 
661 AG CAT GAT CAT T G GAG GT AT C GAC CACT C GCT GT ACAC AG GCAGT CT CT GGT AT AC AC C C 720 

7 60 AT C C G GCGG GAGT GGT AT TAT GAGGT CAT C ATT GT GC GGGT GGAGAT CAATGGACAGGAT 819 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I M [ I I I II I I I 
721 AT CCGGCGGGAGT GGT ATT AT GAGGTGAT CATT GT GCGGGTGGAGAT CAATGGACAGGAT 780 

820 CT GAAAAT GGACT GCAAG GAGT ACAAC TAT GACAAGAGCAT T GT GGAC AGT GGC AC CAC C 879 

I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
7 81 CT GAAAAT GGACT GCAAG GAGT ACAAC TAT GACAAGAGCAT T GT GGAC AGT GGCAC CAC C 840 

880 AACCTTCGTTTGCCCAAGAAAGTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCC 939 

I I I I M I I I I I I I I I! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
841 AAC CTT CGT T T GC C CAAGAAAGT GT TT GAAGCT GCAGT CAAAT C CAT CAAGGC AG C CT C C 900 

94 0 TCCACGGAGAAGTTCCCTGATGGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCA 999 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
901 TCCACGGAGAAGTTCCCTGATGGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCA 960 

1000 GG CAC CAC C C CT T GGAAC ATT T T C C CAGT C AT CT C ACT C TAC C T AAT G GGT GAGGT TAC C 1059 
I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I 
961 GG CAC CAC C C CT T GGAACATT T T C C CAGT CAT CT C ACT C T AC CT AAT G GGT GAGGT TAC C 1020 


Qy 1060 AACCAGTCCTTCCGCATCACCATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTG 1119 

I I I II I I I I I I I I I I I I I I I I I M I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1021 AAC C AGT C CT T C C GC AT CAC CAT C C T T C C GCAG CAAT AC CTGC G GC CAGT GGAAGAT GT G 1080 

Qy 1120 GC CAC GT C C CAAGAC GACT GTT ACAAGTTT G C CAT CT C ACAGT CAT C CAC G GGC ACT GT T 1179 

I I I I I II I I I I I I I I I M II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
Db 1081 GC CACGT C C CAAGAC GACT GT T ACAAGT T T GC CAT CT CAC AGT CAT C CAC GGGCAC T GT T 114 0 

Qy 1180 ATGGGAGCTGTTATCATGGAGGGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATT 1239 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I N I I I I I I I I 
Db 1141 ATGGGAGCTGTTATCATGGAGGGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATT 1200 

Qy 1240 GGCTTTGCTGTCAGCGCTTGCCATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGC 1299 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I 

Db 1201 GGCTTTGCT GT CAG C G CT T GC CAT GT GCAC GATGAGT T CAGGAC G GC AGC G GT G GAAGGC 1260 

Qy 1300 CCTTTT GT CACCTT GGACATGGAAGACTGTGGCT ACAACATT C CACAGACAGAT GAGT CA 1359 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 12 61 C CTT TT GT CAC CT T GGAC AT GGAAGACT GT GGCT ACAACATT C CACAGACAGAT GAGT CA 132 0 

Qy 1360 ACCCTCATGACCATAGCCTATGTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTC 1419 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
Db 1321 ACCCTCATGACCATAGCCTATGTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTC 1380 

Qy 1420 TGCCTCATGGTGTGTCAGTGGCGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTT 1479 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I ! I I I 

Db 1381 TGCCTCATGGTGTGTCAGTGGCGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTT 144 0 

Qy 1480 GCT GAT GACAT CT C C C T GCT GAAGT GAGGAGG CC C AT G G GCAGAAGAT AGAGATT C C C C T 1539 

I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
Db 1441 GCT GAT GACAT CT CCCT GCT GAAGT GAGGAGGCCCATGGGCAGAAGATAGAGATTC CCCT 1500 

Qy 1540 GGACCAC AC CT CC GT GGT T C ACT TT GGT CACAAGT AGGAGACAC AGAT GGC AC CT GT GGC 1599 

I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I 

Db 1501 GGACCAC AC CT CC GT GGTT C ACT T T GGT CACAAGT AGGAGACAC AGAT GGC AC CT GT GGC 1560 

Qy 1600 C AGAGC AC CT CAG GAC C CT C C C CAC CCAC CAAAT GC C TCTGCCTT GAT GGAGAAGGAAAA 1659 

I I I I I I I I I I I I I II I I II I I I I I M I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1561 C AGAG CAC CT CAGGAC C CT C C C CAC CCAC CAAAT GCCTCTGCCTT GAT GGAGAAGGAAAA 162 0 

Qy 1660 GGCTGGCAAGGTGGGTTCCAGGGACTGTACCTGTAGGAAACAGAAAAGAGAAGAAAGAAG 1719 

I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
Db 1621 GGCTGGCAAGGTGGGTTCCAGGGACTGTACCTGTAGGAAACAGAAAAGAGAAGAAAGAAG 168 0 

Qy 1720 CACT CT GCT GGC GGGAATACT CTT GGT CAC CT CAAAT TTAAGTCGGGAAATTCT GCT GCT 177 9 

M I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1681 CACT CT GCT GGC G G GAATACT CTT GGT CAC CT CAAAT TTAAGTCGGGAAATTCT GCT GCT 174 0 

Qy 1780 TG7VAACTTCAGCCCTGAACCTTTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATT 1839 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 
Db 1741 T GAAACT T CAGC C C T GAAC CT T T GT CC AC C ATT C CT TT AAATT CT C CAAC C CAAAGT AT T 1800 

Qy 1840 CTTCTTTTCTTAGTTTCAGAAGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCC 1899 

I II I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I M 
Db 1801 CTTCTTTTCTTAGTTTCAGAAGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCC 1860 

Qy 1900 TGTGGTACCCTGGCAGAGAAGAGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGA 1959 


1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 1861 TGTGGTACCCTGGCAGAGAAGAGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGA 1920 

Qy 1960 GAGGAT GCACAGTT T GC TATT T GC T TT AGAGAC AGGGACTGT AT AAACAAGC CT AACAT T 2019 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
Db 1921 GAGGAT GCACAGTT T GCT AT T T GCT TTAGAGACAGGGACT GT AT AAACAAGC C T AACAT T 1980 

Qy 2 020 GGTGCAAAGATTGCCTCTTGAATTAAAAAAAAAAAAAA 2057 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1981 GGTGCAAAGATTGCCTCTTGAATTAAAAAAAAACTAGA 2018 


RESULT 13 
US-09-795-903A-1 

Sequence 1, Application US/09795903A 
Patent No. US2 00201647 60A1 
GENERAL INFORMATION: 
APPLICANT: Tang, Jordan J.N. 
APPLICANT: Lin, Xinli 
APPLICANT: Koelsch, Gerald 

TITLE OF INVENTION: Catalytically Active Recombinant Memapsin and Methods 
TITLE OF INVENTION: of Use Thereof 
FILE REFERENCE: OMRF 179 

CURRENT APPLICATION NUMBER: US/09/795, 903A 
CURRENT FILING DATE: 2001-02-28 
PRIOR APPLICATION NUMBER: 09/604,608 
PRIOR FILING DATE: 2000-06-27 
PRIOR APPLICATION NUMBER: 60/168,060 
PRIOR FILING DATE: 1999-11-30 
PRIOR APPLICATION NUMBER: 60/177,836 
PRIOR FILING DATE: 2000-01-25 
PRIOR APPLICATION NUMBER: 60/178,368 
PRIOR FILING DATE: 2000-01-27 
PRIOR APPLICATION NUMBER: 60/210,292 
PRIOR FILING DATE: 2000-06-08 
NUMBER OF SEQ ID NOS : 31 
SOFTWARE: Patentln Ver. 2.1 
SEQ ID NO 1 
LENGTH: 3252 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-09-795-903A-1 

Query Match 97.1%; Score 2010; DB 10; Length 3252; 

Best Local Similarity 99.8%; Pred. No. 0; 

Matches 2013; Conservative 0; Mismatches 5; Indels 0; Gaps 0; 

Qy 40 GCGGGAGTGCTGCCTGCCCACGGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGC 99 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I M I I I i I I I I I I I I I I I I I I I I II 

Db 1 GCGGGAGTGCTGCCTGCCCACGGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGC 60 

Qy 100 CTGGGGGGCGCCCCCCTGGGGCTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAG 159 

I I I I I I I II I I I I I I I I II I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 CTGGGGGGCGCCCCCCTGGGGCTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAG 12 0 


Qy 


160 CCCGGCCGGAGGGGCAGCTTTGTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAG 219 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I 


Db 121 CCCGGCCGGAGGGGCAGCTTTGTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAG 18 0 

Qy 220 GGCTACTACGTGGAGATGACCGTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGAT 279 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 GGCTACTACGTGGAGATGACCGTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGAT 240 

Qy 2 80 ACAGGCAGCAGTAACTTTGCAGTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTAC 339 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 241 ACAGGCAGCAGTAACTTTGCAGTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTAC 300 

Qy 34 0 CAGAGGCAGCTGTCCAGCACATACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACC 399 

I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 301 C AGAGG CAGC T GTC CAGCAC AT AC CGGGAC CT C C GGAAGGGT GT GT AT GT GC C CT AC AC C 360 

Qy 4 00 CAGGGCAAGTGGGAAGGGGAGCTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAAC 459 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 

Db 361 CAGGGCAAGTGGGAAGGGGAGCTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAAC 420 

Qy 4 60 GT C ACTGT GC GT GC CAACAT T GCT GC CAT CACT GAAT C AGACAAGT T CTT C AT CAACGG C 519 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 421 GT C ACT GT GC GT GC CAAC ATT GCT GC CAT CACT GAAT C AGACAAGT T CTT CAT CAACGGC 48 0 

Qy 52 0 TCCAACTGGGAAGGCATCCTGGGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCC 57 9 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I 

Db 4 81 TCCAACTGGGAAGGCATCCTGGGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCC 54 0 

Qy 5 80 CTGGAGCCTTTCTTTGACTCTCTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTG 639 

II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 541 CTGGAGCCTTTCTTTGACTCTCTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTG 600 

Qy 640 CACCTTTGTGGTGCTGGCTTCCCCCT CAAC CAGTCTGAAGT GCT GGCCTCTGTCGGAGGG 699 

II II I I I I I I I I I I I I I I M I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 601 CAGCTTTGTGGTGCTGGCTTCCCCCT CAAC CAGTCTGAAGT GCT GGCCTCTGTCGGAGGG 660 

Qy 7 00 AGC AT GAT C ATT GGAGGT AT C GAC CACT C G CT GT AC AC AGGC AGT CT CT GGTAT AC AC C C 759 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 

Db 661 AGCAT GAT CAT T GGAGGT AT C GAC CACT C GCT GT AC AC AGGC AGT CT CT GGTAT ACAC C C 720 

Qy 7 60 AT C C GGC GGGAGTG GT AT TAT GAGGT CAT CAT T GT GC GGGT GGAGAT CAAT G GACAGGAT 819 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 721 AT C C GG C GG GAGTG GT AT TAT GAGGT GAT CAT T GT GC GG GT GGAGAT CAAT GGACAGGAT 780 

Qy 82 0 CT GAAAAT GGACT GCAAGGAGT ACAACT AT GACAAGAGCAT T GT GGACAGT GGC AC C AC C 879 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I 

Db 781 CT GAAAAT GGACT GCAAGGAGT ACAACT AT GACAAGAGCAT T GT GGACAGT GGC AC CAC C 840 

Qy 880 AACCTTCGTTTGCCCAAGAAAGTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCC 939 

I I I I I I I II I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 841 AAC CT T C GT TT GCC CAAGAAAGT GTTT GAAG CT G C AGT CAAAT C CAT CAAGGC AGC CT C C 900 

Qy 940 TCCACGGAGAAGTTCCCTGATGGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCA 999 

I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 901 TCCACGGAGAAGTTCCCTGATGGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCA 960 

Qy 1000 GGCACCACCCCTTGGAACATTTTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACC 1059 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 961 GGCACCACCCCTTGGAACATTTTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACC 102 0 


Qy 1060 AAC CAGT C CT T CC GCAT CAC CAT C CTT C C GC AGCAATAC CT GC GGC C AGT GGAAGAT GT G 1119 

I I I I I M I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I 
Db 1021 AAC CAGT C CTT CC GCAT CAC CAT C CTT C C GCAGCAATACCT GC GGC CAGT GGAAGAT GT G 1080 

Qy 1120 GC CAC GT C CCAAGAC GACT GTT ACAAGT T T G C CAT CT CACAGT CAT CC AC GGGCACT GT T 117 9 

I II I I M I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I 
Db 1081 GC CACGT C C CAAGAC GACT GTT ACAAGT T T GC CAT CT CACAGT CAT CCAC GGGCAC T GT T 114 0 

Qy 1180 ATGGGAGCTGTTATCATGGAGGGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATT 1239 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I M I 
Db 1141 ATGGGAGCTGTTATCATGGAGGGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATT 12 00 

Qy 124 0 GGCTTTGCTGTCAGCGCTTGCCATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGC 12 99 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I 

Db 1201 GGCTTTGCTGTCAGCGCTTGCCATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGC 1260 

Qy 1300 C CT TT T GT CAC CT T GGAC AT GGAAGACT GT GGCT ACAAC ATT C CAC AGACAGAT GAGT C A 135 9 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I II I 

Db 1261 C CT TT T GT CAC CT T GGAC AT GGAAGACT GT GGCT ACAAC ATT C CAC AGACAGAT GAGT CA 132 0 

Qy 1360 AC C CT CAT GAC C AT AGC CT AT GT C AT GG CT GC C AT CT G C GC C CT CT T CAT GC T GC CAC T C 1419 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I II M I I I I I I 
Db 1321 ACCCTCATGACCATAGCCTATGTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTC 1380 

Qy 1420 TGCCTCATGGTGTGTCAGTGGCGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTT 1479 

I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I II I I I I I I I I 

Db 1381 TGCCTCATGGTGTGTCAGTGGCGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTT 14 4 0 

Qy 14 80 GCT GAT GAC AT CT C C CT GCT GAAGT GAGGAGG CC CAT GGGCAGAAGAT AGAGATT C C C CT 1539 

I I I I I I I I I I I I I I II I I I I i I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
Db 1441 GCT GAT GACAT CT C C CT GCT GAAGT GAGGAGG CC CAT GGGCAGAAGAT AGAGATT C C C CT 1500 

Qy 1540 GGACC ACAC CT C C GT G GT TCACT T T GGT CACAAGT AGGAGACAC AGAT GGCAC CT GT GGC 1599 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I M I I I I I I I I I I I I I I I I I I I I I I 

Db 1501 GGAC CACAC CT CC GT GGT TCACTTT GGT CACAAGT AGGAGACAC AGAT GGCAC CTGT GGC 1560 

Qy 1600 C AGAGCAC CT C AGGAC C CT C C C CAC CCAC CAAAT GCCTCTGCCTT GAT GGAGAAGGAAAA 1659 

II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I 

Db 1561 C AGAGCAC CT C AGGAC C CT C C CCAC CCAC CAAAT G C CT CT GC CT T GAT GGAGAAGGAAAA 1620 

Qy 1660 GG CT GGCAAGGT GGGT T C C AGGGACT GT AC CT GT AGGAAAC AGAAAAGAGAAGAAAGAAG 1719 

I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
Db 1621 GGCTGGCAAGGTGGGTTCCAGGGACTGTACCTGTAGGAAACAGAAAAGAGAAGAAAGAAG 168 0 

Qy 1720 CACTCTGCTGGCGGGAATACTCTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCT 177 9 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1681 CACTCTGCTGGCGGGAATACTCTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCT 174 0 

Qy 1780 T GAAACT T CAGCC CT GAACCT TT GT CC AC CAT T C CT T TAAAT T CT C CAAC C CAAAGT AT T 1839 

I I II I I I I I I I I I I I I I I I E I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1741 TGAAACTTCAGCCCTGT^lACCTTTGTCCACCATTCCTTTATVATTCTCCAACCCAAAGTATT 18 00 

Qy 184 0 CTTCTTTTCTTAGTTTCAGAAGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCC 1899 

I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 1801 CTTCTTTTCTTAGTTTCAGAAGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCC 1860 


Qy 1900 TGTGGTACCCTGGCAGAGAAGAGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGA 1959 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I 
Db 1861 TGTGGTACCCTGGCAGAGAAGAGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGA 1920 

Qy 1960 GAGGAT GCACAGTTT GCT ATTT GCTTTAGAGACAGGGACT GT ATAAACAAGCCTAACATT 2 019 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1921 GAGGAT GCACAGTTT GCTATTTGCTTTAGAGACAGGGACT GT ATAAACAAGCCTAACATT 198 0 

Qy 2020 GGTGCAAAGATTGCCTCTTGAATTAAAAAAAAAAAAAA 2 057 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1981 GGT GCAAAGAT T GC CT CT T GAAT T AAAAAAAAACT AGA 2 018 


RESULT 14 
US-10-032-818-1 

Sequence 1, Application US/10032818 
Publication No. US20030092629A1 
GENERAL INFORMATION: 
APPLICANT: Tang, Jordan J.N. 
APPLICANT: Koelsch, Gerald 
APPLICANT: Ghosh, Arun K. 

TITLE OF INVENTION: Inhibitors of Memapsin 2 and Use Thereof 
FILE REFERENCE: 2 932.1006-007 
CURRENT APPLICATION NUMBER: US/10/032, 818 
CURRENT FILING DATE: 2001-12-28 
PRIOR APPLICATION NUMBER: US 60/275,756 
PRIOR FILING DATE: 2001-03-14 
PRIOR APPLICATION NUMBER: US 60/258,705 
PRIOR FILING DATE: 2000-12-28 
NUMBER OF SEQ ID NOS : 8 3 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 1 
LENGTH: 3252 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-032-818-1 

Query Match 97.1%; Score 2010; DB 15; Length 3252; 

Best Local Similarity 99.8%; Pred. No. 0; 

Matches 2013; Conservative 0; Mismatches 5; Indels 0; Gaps 0; 

Qy 40 GCGGGAGTGCTGCCTGCCCACGGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGC 99 

I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 GCGGGAGTGCTGCCTGCCCACGGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGC 60 

Qy 100 CTGGGGGGCGCCCCCCTGGGGCTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAG 159 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 

Db 61 CTGGGGGGCGCCCCCCTGGGGCTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAG 120 

Qy 160 CCCGGCCGGAGGGGCAGCTTTGTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAG 219 

I II I I I I I I I I I I II I I I I I I I I I I I I I I M II I I I I I I I I I I I I I I M I I I I I I I I I I I 

Db 121 CCCGGCCGGAGGGGCAGCTTTGTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAG 18 0 

Qy 220 GGCTACTACGTGGAGATGACCGTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGAT 27 9 

I I I I I I I I I I I I I I II M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 GGCTACTACGTGGAGATGACCGTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGAT 24 0 


Qy 280 ACAGGCAGCAGTAACTTTGCAGTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTAC 339 

I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I 
Db 241 ACAGGCAGCAGTAACTTTGCAGTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTAC 300 

Qy 34 0 CAGAGGCAGCTGTCCAGCACATACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACC 399 

I I I M M I I II I I I I I I I I I M I I I I I I I I I I I M i I M M M II M II I I I M II I II I 

Db 301 CAGAGGCAGCTGTCCAGCACATACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACC 360 

Qy 400 CAGGGCAAGTGGGAAGGGGAGCTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAAC 459 

I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I E I I I I I I 1 I I I I I I I I I I I 

Db 361 CAGGGCAAGTGGGAAGGGGAGCTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAAC 420 

Qy 4 60 GTCACTGTGCGTGCCAACATTGCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGC 519 

I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 421 GTC ACTGT GC GTGC CAACATT GCTGC CAT CACT GAAT CAGACAAGTT CT T CAT C AAC GGC 4 80 

Qy 52 0 TCCAACTGGGAAGGCATCCTGGGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCC 579 

I I I I I I M II I I M I I I I M I I I I M I I I I 1 I I ! | M I I I I I I I I I I I I I 1 I I I I I i I I I 

Db 481 TCCAACTGGGAAGGCATCCTGGGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCC 540 

Qy 58 0 CTGGAGCCTTTCTTTGACTCTCTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTG 639 

M I I I i I I I I ! I I I I I M I M I M I M I I I I I I I I I I I I M I I M I I I I I I 11 I I I I I I I 
Db 541 CTGGAGCCTTTCTTTGACTCTCTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTG 600 

Qy 64 0 CACCTTTGTGGTGCTGGCTTCCCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGG 699 

II I I I I I M I I I I I I I I I I I I I I M I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I 

Db 601 CAGCTTTGTGGTGCTGGCTTCCCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGG 660 

Qy 700 AGCAT GAT CATT GGAGGT AT C GAC C ACT C GCT GT ACACAGGCAGT CT CT G GTAT ACACC C 759 

I I I I I I I I I I ! I I I I I M I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I i I I I I I 

Db 661 AGCAT GAT CATT GGAGGT AT C GAC CACT CGCT GT ACACAGGCAGT CT CT GGTAT ACACC C 720 

Qy 7 60 AT C C GGC GG GAGT GGTAT TAT GAGGT CAT CAT T GT GC GGGT G GAGAT CAAT GGACAGGAT 819 

I I I I I I M I I I I I II I I II I I I I II I I II I II t II II I I I I I M I I I M M M I I II I I 

Db 721 AT C C GGC G G GAGT GGTAT TAT GAGGT GAT CAT T GT G C GGGT GGAGAT CAAT GGACAGGAT 7 80 

Qy 820 CT GAAAAT GGACT G CAAG GAGT ACAACT AT GACAAGAGCATT GT GGACAGT GGCAC CAC C 879 

I I I I I M I I I M I I M I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I II I I I I I I 
Db 7 81 CT GAAAAT GGACT G CAAGGAGTACAACTAT GACAAGAGCAT T GT GGACAGT GGCACCAC C 84 0 

Qy 880 AACCTTCGTTTGCCCAAGAAAGTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCC 939 

I I I I I I I M I I I I I I I I I I I I I I I I I I II I | I M M I I I I II I I I I I I I I I I I I II I I M 
Db 841 AACCTTCGTTTGCCCAAGAAAGTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCC 900 

Qy 940 TCCACGGAGAAGTTCCCTGATGGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCA 999 

I I I I II I I I M I I I I I I I I I M I I I I II I I I I II I I I I II I I I | I I I M I | | I | | II | | I 
Db 901 TCCACGGAGAAGTTCCCTGATGGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCA 960 

Qy 1000 GGCACCACCCCTTGGAACATTTTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACC 1059 

I I I I I M I I I I I I I I I M I M I I I I I I I II I I I I I I I I II I I I I I I I I M I I I I I I I I I I 
Db 961 GGCACCACCCCTTGGAACATTTTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACC 1020 

Qy 1060 AAC CAGT C CT T C C G CAT CAC CAT C C T T C C GC AGC AATAC C T GC GGC C AGT GGAAGAT GT G 1119 

I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I M I I I I I 
Db 1021 AACCAGTCCTTCCGCATCACCATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTG 1080 


Qy 112 0 GCCACGT CCCAAGACGACT GTTACAAGTTT GCCAT CTCACAGTCAT CCACGGGCACT GTT 117 9 


1081 GCCACGTCCCAAGACGACTGTTACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTT 114 0 

118 0 ATGGGAGCTGTTATCATGGAGGGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATT 1239 

| I | I I I I I I I I I I M I I I I I I I I I I I I I I I I I I > I I I I I I M I I I I I I I I I I I I I I I I I I 
1141 ATGGGAGCTGTTATCATGGAGGGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATT 1200 

124 0 GGCTTTGCTGTCAGCGCTTGCCATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGC 1299 

I I I I I I I I I M I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I M I I I M N I 
1201 GGCTTTGCTGTCAGCGCTTGCCATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGC 1260 

130 0 C CT TTT GT CAC C T T GGAC AT GGAAGACT GT GG CT ACAAC AT T C C ACAGACAGAT GAGT CA 1359 

| | I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
12 61 C CT T T T GT CAC CT T GGAC AT GGAAGACT GT GG CT ACAAC AT T C C AC AGAC AGAT GAGT CA 1320 

1360 ACCCTCATGACCATAGCCTATGTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTC 1419 

I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I M I I I I I I I I I 
132 1 ACCCTCATGACCATAGCCTATGTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTC 1380 

142 0 TGCCTCATGGTGTGTCAGTGGCGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTT 147 9 

| I I I I M I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I M II I I I I I I I I I I I I II I I I 
1381 TGCCTCATGGTGTGTCAGTGGCGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTT 144 0 

14 8 0 G CT GAT GACAT CT C C CT GCT GAAGT GAGGAGG C C CAT G G GCAGAAGAT AGAGAT T CC CCT 1539 

| I I M | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I M I I I I I I I I I I I I I I 
14 41 GCT GAT GACAT CT C C CT GCT GAAGT GAGGAGGC C CAT GGGCAGAAGAT AGAGATT CCC CT 1500 

154 0 GGACCACACCTCCGTGGTTCACTTTGGTCACAAGTAGGAGACACAGATGGCACCTGTGGC 1599 

I I I M i I I I I I I I I I I I I I I M I I I I I I ! I I I I i M I I I I I I I I I I I I I I I M I I I I I I I 

1501 G GAC CACAC CT C C GT GGTT CACTT T GGT C ACAAGT AGGAGAC ACAGAT GGCACCT GT GGC 15 60 

1600 CAGAGCAC CT C AGGAC CCT CC C CAC C CAC CAAAT GC CTCTGCCTT GAT GGAGAAGGAAAA 1659 

I I I I I I I M | I I I I I II I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I M I I 
1561 CAGAGCAC CT C AGGAC CCTC C C CAC C CAC CAAAT GC CT CT GC CT T GAT GGAGAAGGAAAA 1620 

1660 GGCTGGCAAGGTGGGTTCCAGGGACTGTACCTGTAGGAAACAGAAAAGAGAAGAAAGAAG 1719 

| I I I II I I I I I I I M I I I I I I I I M I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I M I I 
1621 GGCT GGCAAGGT GGGTT CCAGGGACT GT ACCT GTAGGAAACAGAAAAGAGAAGAAAGAAG 168 0 

172 0 CACTCTGCTGGCGGGAATACTCTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCT 1779 

| | I I I I I I M I I I I I I I I I M I I I I II I I I I I I II I II I I M I I I I I I I I I I II I I I I I I 
1681 CACT CT GCT GGCGGGAAT ACT CTT GGT CACCT CAAATTT AAGT CGGGAAATT CT GCT GCT 17 4 0 

17 8 0 TGAAACTTCAGCCCTGAACCTTTGTCCACCATTCCTTTTW^TTCTCCAACCCAT^AGTATT 18 39 

I M I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I 
1741 T G AAAC T T C AG C C C T G AAC C T T T GT C CAC CAT T C CT T T AAAT T C T C C AAC C C AAAGT AT T 18 00 

184 0 CTTCTTTTCTTAGTTTCAGAAGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCC 18 99 

| | || | | | | || || || I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 

1801 CTTCTTTTCTTAGTTTCAGAAGTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCC 18 60 

1900 T GT GGTACCCTGGCAGAGAAGAGAC CAAGCTT GTTT CCCTGCTGGCCAAAGTCAGT AGGA 1959 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I M 

1861 T GT GGT ACCCT GGC AGAGAAGAGAC CAAGCT T GTTTCCCTGCT G GC CAAAGT CAGT AGGA 1920 

1960 GAGGATGCACAGT T T GCT AT T T GCT T T AGAGAC AGGGACT GT AT AAACAAGC CT AACAT T 2019 
| I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 


Db 1921 GAGGAT GCAC AGT TT GCT AT T T GCTT T AGAGAC AGGGACT GT ATAAACAAG C CT AAC AT T 198 0 


Qy 2020 GGTGCAAAGATTGCCTCTTGAATTAAAAAAAAAAAAAA 2057 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I II M I I 
Db 1981 GGTGCAAAGATTGCCTCTTGAATTAAAAAAAAACTAGA 2018 
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Qy 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

I I I I I I I I I M I I I I I M I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I M M I I I M I I II I 

Db 1 ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 60 

Qy 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 120 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 120 

Qy 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I M I I I I I I I II I I I 
Db 121 CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 18 0 


Qy 181 GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGATGACC 24 0 

I I I I I I I I I I I I I I I II I I I I I I I M I II II I I I I I I I I I I I I I I I I I I I I I I > I I I I I I 
Db 181 GT GGAGAT GGT GGACAAC CT GAGGGGCAAGT C G G G G CAGGGCT ACT AC GT G GAGAT GAC C 240 

Qy 241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 300 

Qy 301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

I | | | | || | | | I I M I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I II I I I I I I I I I I I 

Db 301 GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 360 

Qy 361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 420 

I || M | | | | I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 361 TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 42 0 

Qy 421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 480 

I | | | M I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I M I M I I I I I I I I I 

Db 421 CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCC7UVCATT 4 80 

Qy 481 GCT GC CAT CACTGAAT CAGACAAGTT CTT CAT CAACGGCT CCAACT GGGAAGGCAT CCT G 540 

I II I I I I I I I M I I I I I I M I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I 
Db 481 GCT GCCAT CACTGAAT CAGACAAGTT CTT CAT CAACGGCT CCAACT GGGAAGGCAT CCT G 54 0 

Qy 541 GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 600 

I I I I I I I I I I I I M I I I I I I I I I I I I I 
Db 541 GGGCTGGCCTATGCTGAGATTGCCAGG — 5 67 

Qy 601 CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCACCTTTGTGGTGCTGGCTTC 660 

I I I I I I I I I I I I I I I I I I 

Db 568 CTTTGTGGTGCTGGCTTC 585 

Qy 661 C C C CT CAAC CAGT CT GAAGT GCT GGC CTCT GT C GGAGG GAG CAT GAT CAT T G GAGGT AT C 720 

I I I I I I I I I M I I -II II I I I I I I I I I M I I I I I I I I I I I I I I I I I M I I I I I I I I I I I II 

Db 586 CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGGTATC 645 

Qy 721 GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTATTAT 7 80 

I | | | | | | I I I I I M I I I I I I I I I I M I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I M 

Db 646 GACCACT CGCT GTACACAGGCAGT CT CTGGTATACACCCAT CCGGCGGGAGT GGT AT TAT 705 

Qy 781 GAGGT CAT C ATT GT GC GGGT G GAGAT CAAT GGAC AGGAT CT GAAAAT G GACT GCAAG GAG 84 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 7 06 GAGGT GAT CAT T GT GC GGGT GGAGAT CAATGGAC AGGAT CT GAAAAT GGACT GCAAGGAG 765 

Qy 8 41 T ACAACT AT GACAAGAGCATT GT GGACAGT GGC AC C AC CAAC CTT CGT T T GC C CAAGAAA 900 

I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
Db 766 T ACAACT AT GACAAGAGCAT T GT GGACAGT GGC AC CAC CAAC CTTCGTTTGCC CAAGAAA 825 

Qy 901 GT GT T T GAAG CT GCAGT CAAAT C CAT CAAGGC AGC CT C CT C CAC GGAGAAGT T C CCT GAT 960 

I | | I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 826 GT GT TT GAAGCT GCAGT CAAAT C CAT CAAGGCAG C CT C CT C CAC GGAGAAGT T C C CT GAT 885 

Qy 961 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT .1020 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I II I I I I I I I I I I I I I I I II 
Db 8 86 GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 945 


Qy 

1021 

Db 

946 

Qy 

1081 

Db 

1006 

Qy 

1141 

Db 

1066 

Qy 

1201 

Db 

1126 

Qy 

1261 

Db 

1186 

Qy 

1321 

Db 

1246 

Qy 

1381 

Db 

1306 

Qy 

1441 

Db 

1366 

Qy 

1501 

Db 

1426 

Qy 

1561 

Db 

1486 

Qy 

1621 

Db 

1546 

Qy 

1681 

Db 

1606 

Qy 

1741 

Db 

1666 

Qy 

1801 

Db 

1726 

Qy 

1861 


TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCACC 1080 

I | M | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I M I I I M M I I I I 

T T C C C AGT CAT C T CACT CT AC CT AAT G G GT GAGGT T AC CAAC C AGT C CT T C C GCATCACC 1005 

AT C CT T C C GC AGCAAT AC CT GCGGCCAGT GGAAGAT GT GG C CAC GT C C CAAGACGACT GT 114 0 

M I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 

AT C CT T C C GCAG CAAT AC CT G CG GCCAGT G GAAGAT GT G GC C AC GT C C CAAGAC GACT GT 1065 
T ACAAGT T T GC C AT CT CAC AGT CATCC AC GG GC ACT GT TAT GGGAGC T GT TAT CAT G GAG 1200 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I M I I II I I I I I I I I I I I I I 

T ACAAGT T T GC CAT CT C AC AGT CAT CCACGGG C ACT GTT AT G GGAGCT GT TAT CAT GGAG 1125 

GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1260 

I | | | | | | M | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I 

GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 1185 

CAT GT GC AC GAT GAGT T C AGGAC GGCAGCGGT GGAAG GC C CT T TT GT C AC CT T GGAC AT G 1320 

| | | | | | | | | | | | II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I M I I I I I I I I I I I I I 

CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGACATG 1245 

GAAGACTGTGGCTACAACATTCCACAGACAGATGAGTCAACCCTCATGACCATAGCCTAT 1380 

I | | | | | || | | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I M I I I I 

GAAGACT GT GGC T ACAACAT T C CACAGACAGAT GAGT CAAC C CT CAT GAC C AT AGC CT AT 1305 

GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 1440 

I I I M I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I 

GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGG 1365 

CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 150 0 

M I I I I I M I I I I II I I I I I I I I I I I I I ! I I M I I I I I I II I I I II I I I I II I I II I I I I 

CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 142 5 

AAGT GAGGAGGC C CATGGGCAGAAGATAGAGATTCCCCTGGACCACACCT CCGTGGTT CA 1560 
I I I I M I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I M I I I II I 
AAGT GAGGAGGCC CAT GGGCAGAAGAT AGAGATTCCCCTGGACCACACCT CCGT GGTT CA 14 8 5 

CTT T G GT C ACAAGT AG G AGACACAGAT GGCAC CT GT GGCCAGAGCAC CT C AGGAC C CT C C 162 0 

| | | | | | | |. | | | | | | | I I I I I I I I I I I I I I I II I I I I II II I I I I I I I I I I I M i I I I I II 

CTTT GGT CACAAGT AGGAGACACAGAT GGCACCTGTGGCCAGAGCACCTCAGGACCCT CC 154 5 

CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCAG 168 0 
I M I I I I I I I I I I I I I I I I M 1 I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCAG 1605 

GGACT GT AC CT GT AGGAAAC AGAAAAG AGAAGAAAGAAGCACT CT G CT GGC GGGAAT ACT 17 4 0 

I I I M I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I M I M 

GGACT GT ACCT GT AGGAAAC AGAAAAGAGAAGAAAGAAGCACT CT GCT GGC GG GAAT ACT 1665 

CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCT 18 00 
I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I 
CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCT 1725 

TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 18 60 

M I M I I I M I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGTTTCAGAA 17 85 

GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAAG 1920 


Db 17 8 6 GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAAG 1845 

Qy 1921 AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCTATT 198 0 

I I I I I I I M II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 

Db 184 6 AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCTATT 1905 

Qy 1981 T GC TTT AGAGACAGGGACT GTAT AAACAAG C CTAAC AT T GGT G C AAAGAT T GC CT CT T GA 2 04 0 

I I I I II I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I M I 

Db 1906 T GC TTT AG AGAC AGGGACT GTAT AAACAAG C CTAAC AT T GGT GCAAAGAT T GC C T CT T GA 1965 

Qy 2041 ATTAAAAAAAAA 2052 

I I I I II I I I I 
Db 1966 AAAAAAAAAAAA 1977 


Search completed: January 21, 2004, 20:26:49 
Job time : 707 sees 


